Protein M fusion proteins and uses

ABSTRACT

Fusion proteins with immunoglobulin binding properties, their uses and related methods and compositions are disclosed. The fusion proteins are comprised of an antibody-binding fragment of protein M from  Mycoplasma  spp. conjugated to a receptor fragment. The receptor fragment is a protein fragment to which a pathogen, a toxin or a cancer cell can specifically bind. The fusion proteins can be used to neutralize or eradicate a wide group of pathogens, toxins or cancer cells.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. provisional application 63/079,815 filed Sep. 17, 2020, the content of which is incorporated herein.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

Not Applicable.

SEQUENCE LISTING ON ASCII TEXT

This patent application file contains a Sequence Listing submitted in computer readable ASCII text format (file name: DELA-02-US-Sequence-Listing.txt, date recorded: Aug. 30, 2021, size: 267,396 bytes). The Sequence Listing, which is a part of the present disclosure, includes a computer readable form and a written sequence listing comprising nucleotide and/or amino acid sequences of the present invention. The sequence listing information recorded in computer readable form is identical to the written sequence listing. The content of the Sequence Listing file is incorporated herein by reference in its entirety.

TECHNICAL FIELD

The present teachings relate to methods and compositions that utilize Protein M fusion proteins. Some of the disclosed methods and compositions relate to methods of neutralizing or eradicating various human pathogens and toxins.

INTRODUCTION

Many emerging and known pathogens continue to present a serious threat to human health and safety. In the past few decades, many infectious diseases, such as those caused by the SARS-CoV-2 virus, the human immunodeficiency virus (HIV) and others have effectively migrated from animal to human hosts and devastated entire populations and economies. Despite some successes in treatment of these pathogens, the options remain limited or not available, like in the case of the SARS-CoV-2 virus. Thus, there remains a need in the art for an efficient general method for neutralizing pathogens or clearing out pathogens from human body.

SUMMARY

The present teachings include a method for neutralizing a pathogen, wherein the pathogen has a specific binding affinity for a receptor fragment, the method comprising: providing conditions for interaction between the pathogen and a fusion protein that comprises a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO: 1 or the sequence SEQ ID NO: 2 conjugated to the receptor fragment, whereby the fusion protein binds to and neutralizes the pathogen.

In accordance with a further aspect, the receptor fragment is a protein fragment of a cellular receptor.

In accordance with a further aspect, the pathogen is SARS-CoV-2 virus and the receptor fragment has the sequence SEQ ID NO:15.

In accordance with a further aspect, conjugation of the polypeptide and the receptor fragment is made through a spacer.

In accordance with a further aspect, the spacer is a peptide having one of the following sequences: SEQ ID NO: 12-14.

In accordance with a further aspect, the receptor fragment comprises one of the following sequences: SEQ ID NO: 16-36.

In accordance with a further aspect, the receptor fragment has one of the following sequences: SEQ ID NO: 16-36.

In accordance with a further aspect, the fusion protein neutralizes the pathogen via recruitment of C1q protein.

The present teachings also include a method for eradicating a bloodborne pathogen in a subject, wherein the pathogen has a specific binding affinity for a receptor fragment inside the subject's body, the method comprising:

receiving a sample of blood, serum or plasma from the subject or from a donor compatible with the subject, wherein the sample comprises immunoglobulins;

adding a fusion protein that comprises a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO:1 or the sequence SEQ ID NO:2 conjugated to the receptor fragment to the sample, wherein the fusion protein binds to the immunoglobulins present in the sample; administrating the sample having the fusion protein bound to the immunoglobulins into the subject's body, in an amount sufficient to eradicate the pathogen in the subject.

In accordance with a further aspect, the receptor fragment comprises one of the following sequences: SEQ ID NO: 16-36.

In accordance with a further aspect, the fusion protein bound to the immunoglobulins eradicates the pathogen via recruitment of C1q protein.

The present teachings also include a fusion protein having a specific binding affinity for an immunoglobulin molecule, comprising a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO:1 or the sequence SEQ ID NO:2 conjugated to a fusion partner, wherein the fusion partner has a sequence that is at least 90% identical to one of the following sequences: SEQ ID NO: 15-36.

In accordance with a further aspect, conjugation of the polypeptide and the fusion partner is made through a spacer.

In accordance with a further aspect, the spacer is a cleavable peptide having one of the following sequences: SEQ ID NO: 96-98.

The present teachings also include a method for neutralizing a toxin in a subject, wherein the toxin has a specific binding affinity for a receptor fragment, the method comprising:

receiving a sample of blood, serum or plasma from the subject or from a donor compatible with the subject, wherein the sample comprises immunoglobulins;

adding a conjugated protein that comprises a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO:1 or the sequence SEQ ID NO:2 conjugated to the receptor fragment to the sample, wherein the conjugated protein binds to the immunoglobulins present in the sample; administrating the sample having the conjugated protein bound to the immunoglobulins into the subject's body, in an amount sufficient to eradicate the toxin in the subject.

In accordance with a further aspect, the receptor fragment comprises one of the following sequences: SEQ ID NO: 16-36.

The present teachings also include a method for detecting immunoglobulins that are present in a solution or on a solid support matrix, but not bound to their cognate antigen, the method comprising: contacting immunoglobulins with conjugated proteins in the solution, wherein each conjugated protein comprises a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO:1 or the sequence SEQ ID NO:2 conjugated to a detectable probe, whereby the conjugated proteins bind to immunoglobulins that are not bound to their cognate antigen; separating conjugated proteins that are bound to immunoglobulins from conjugated proteins that are not bound to immunoglobulins; detecting the conjugated proteins that are bound to immunoglobulins by utilizing the detectable probe, thereby detecting immunoglobulins that are not bound to their cognate antigen. Examples of solid support matrix include: blots, beads, microplate well, resin.

In accordance with a further aspect, conjugation of the polypeptide and the detectable probe is made through a spacer.

In accordance with a further aspect, the spacer is a cleavable peptide having one of the following sequences: SEQ ID NO: 96-98.

In accordance with a further aspect, the detectable probe is an enzyme that has a fluorogenic, luminescent or chromogenic substrate.

In accordance with a further aspect, the detectable probe is a protein having a sequence chosen from SEQ ID NO:67-69.

In accordance with a further aspect, the detectable probe is a fluorescent or a luminescent or a radioactive molecule.

In accordance with a further aspect, the detectable probe is an epitope tag having a sequence chosen from SEQ ID NO: 70-81.

In accordance with a further aspect, the detectable probe is a polypeptide having a sequence chosen from SEQ ID NO:82-85 and configured to bind streptavidin and/or avidin.

In accordance with a further aspect, the detectable probe is a polypeptide having a sequence chosen from SEQ ID NO:86-92 or from SEQ ID NO:93-94, and configured to attach to its cognate binding partner, either covalently or non-covalently.

In accordance with a further aspect, the detectable probe is a fluorescent protein having the sequence SEQ ID NO:95.

The present teachings also include a codon-optimized polynucleotide that encodes the fusion protein according to claim 14.

In accordance with a further aspect, the codon-optimized polynucleotide according to claim 29 has a sequence that is at least 95% identical to one of the following nucleic acid sequences: SEQ ID NO: 40-61.

In accordance with a further aspect, the codon-optimized polynucleotide according to claim 29 is inserted in a vector configured for replication and protein expression in mammalian cells.

These and other features, aspects and advantages of the present teachings will become better understood with reference to the following description, examples and appended claims.

BRIEF DESCRIPTION OF THE DRAWINGS

Those of skill in the art will understand that the drawings, described below, are for illustrative purposes only. The drawings are not intended to limit the scope of the present teachings in any way.

FIG. 1. A schematic representation of plasma antibodies of differing isotypes and differing specificities mixed with armY-ACE2 fusion protein. When armY-ACE2 is added to the plasma, armY-ACE2 binds and blocks the antibody antigen-binding region via the armY component (Protein M). The ACE2 component of the fusion protein endows the antibodies with a new binding specificity to the spike protein of SARS-CoV-2, allowing the antibodies to bind SARS-CoV-2 and mark for eradication.

FIG. 2. A schematic representation of one proposed route of administration of armY-ACE2 therapeutic, whereby plasma (containing antibodies) is obtained from the patient or a ABO-compatible donor through apheresis. A measured amount of armY-ACE2 is added to the plasma antibodies, which then acquire SARS-CoV-2 specificity as described in FIG. 1 and as demonstrated in Example 16, FIG. 14 below. A patient with active COVID-19 is treated with armY-ACE2 plasma antibodies and allowed to fully recover.

FIG. 3. Myc-tagged Protein M binds to human IgG antibody coated on wells in a dose-dependent fashion. Bound Protein M is detected via its myc-tag using a mouse IgG1 anti-myc antibody followed by an HRP-labeled goat anti-mouse IgG. Neither antibodies bind the human IgG antibody coated on the well (assay buffer, no Protein M).

FIG. 4A-B. Goat anti-human IgG (FIG. 4A) or Chicken anti-human IgG (FIG. 4B) is neutralized or blocked by Protein M in a dose-dependent fashion and prevented from binding their antigen human IgG, immobilized on the wells.

FIG. 5A-D. Protein M-HRP fusion protein detection of antibodies. Detection of goat F(ab′)₂ antibody fragment (FIG. 5A), dose-dependent detection of human IgG antibody (FIG. 5B) and detection of two mouse monoclonal IgG1 antibodies: anti-myc and anti-CD28 (FIG. 5C) coated on wells. FIG. 5D. Protein M-HRP indirect detection of mouse IgG1 antibody in solution. Protein M-HRP incubated with increasing concentration of mouse IgG1 antibody resulted in the loss of Protein M binding to the human IgG antibody coated on the wells. In the absence of mouse IgG1 antibody in solution, binding of Protein M-HRP to the human IgG antibody coated on the wells is not hindered.

FIG. 6A-B. Antibodies bound to antigen are not detected by Protein M. Myc-specific mouse antibody bound to myc-tagged protein coated on wells in (FIG. 6A) or ASIP-specific rabbit antibody bound to ASIP protein coated on wells in (FIG. 6B) failed to be detected by Protein M-HRP. This is consistent with the described function of Protein M and its inability to bind antibodies already engaged in a complex with their cognate antigen. Presence of antibodies bound to their coated antigen was confirmed using detecting HRP-labeled anti-mouse IgG (A) and anti-rabbit IgG-biotin/streptavidin-HRP (B).

FIG. 7A-B). FIG. 7A. Biotinylated irrelevant (non-specific) antibody binds to SARS-CoV-2 spike protein coated on the wells only when the antibody is in a complex with armY-ACE2. ACE2 domain mediates binding between the spike protein and armY-bound antibody since antibody with Protein M lacking the ACE2 domain is unable to bind the spike protein coated on the well. As expected, antibody alone does not bind the SARS-CoV-2 spike protein, which requires a physical association with armY-ACE2 that binds the spike protein coated on the well. Moreover, armY-ACE2 alone does not produce any signals since the presence of signal requires the interaction between the biotinylated antibody and armY-ACE2 bound to the spike protein coated on the well. FIG. 7 B. Anti-histidine (his) tag was used to detect the his (6× histidine)-tagged SARS-CoV-2 spike protein in a complex with armY-ACE2. Although, not included in this experiment, SARS-CoV-2 spike protein does not bind to human IgG (previous observation), a complex between armY-ACE2 and SARS-CoV-2 spike protein was required for detection by anti-his tag antibody. Myc-specific antibody detected myc-tagged armY-ACE2 binding to human IgG coated wells regardless of whether it was bound or not bound to SARS-CoV-2 spike protein.

FIG. 8. Purified human IgG or antibodies in human serum formed a complex with armY-ACE2 in solution preventing army-ACE2 binding to antibody coated on the wells. As expected, in the absence of antibody in solution binding of armY-ACE2 to the antibody coated wells was not prevented.

FIG. 9. armY-ACE2 engaged antibody binds to K562 cells expressing FcγRII receptor (right panel, army-ACE2+antibody). No binding to K562 cells was observed in the absence of antibody (left panel, army-ACE2 alone, no antibody), therefore, the observed binding of armY-ACE2 to K562 cells is dependent upon the association between the antibody and armY-ACE2.

FIG. 10. Binding of [armY-ACE2+antibody] to K562 cells is prevented by blocking FcγRII receptor using anti-CD32 (IV.3) (left panel). Binding is not blocked by the isotype-match antibody (right panel), demonstrating that armY-ACE2 engaged antibody maintains Fc-receptor binding activity.

FIG. 11A-B. FIG. 11A. Only a complex between armY-ACE2 and mouse IgM resulted in binding to immobilized human C1q complement component. FIG. 11B. Binding of [armY-ACE2+mouse IgG1] complexes to immobilized human C1q complement component is inhibited by pre-incubation of the complex with soluble human C1q, which also suggest that army-ACE2 primes mouse IgG1 antibody to bind C1q in solution. A complex between Protein M (lacking ACE2) with mouse IgG1 did not result in binding to C1q, suggesting the requirement of the fusion partner domain, ACE2, to induce a conformation resulting in antibody binding to C1q.

FIG. 12A-B. FIG. 12A. armY-ACE2 exhibits ACE2 activity in a dose dependent fashion. FIG. 12B. [armY-ACE2+antibody] complexes exhibit ACE2 activity comparable to armY-ACE2 alone, suggesting that binding of armY-ACE2 to antibodies does not interfere with the enzymatic function of ACE2.

FIG. 13A-B. FIG. 13A. Diagram of armY-ACE2 construct showing a myc-tag at the N-terminus, followed by the human ACE2, a linker and Protein M “armY” at the C-terminus. FIG. 13B. Photograph of the SDS-PAGE gel of purified non-reduced (left lane) and reduced (right lane) of armY-ACE2 showing a ˜180 kDa protein band.

FIG. 14A-C. FIG. 14A. Non-immune serum antibodies armed with armY-ACE2 gain the ability to bind to SARS-CoV-2 spike protein. Unarmed non-immune serum (pre-vaccine) does not bind to SARS-CoV-2 spike protein (a) but gain the ability to bind after incubation with armY-ACE2 (b). Approximately one month post-vaccination with the Moderna Covid19 vaccine, serum antibodies bind to the SARS-CoV-2 spike protein coated on the well as expected and served as an assay positive control (c). The assay does not detect armY-ACE2 alone when added to the SARS-CoV-2 spike protein coated wells (d), suggesting a requirement for serum antibodies to be in a stable complex with armY-ACE2 to bind the SARS-CoV-2 spike protein for assay detection. The photo to the right, representative of the duplicate wells, shows the corresponding SARS-CoV-2 spike protein coated wells 20 minutes after the addition of the mixtures, followed by addition of detecting antibody and addition of the TMB substrate, which give rise to the appearance of the blue color product indicative of antibody presence bound to the SARS-CoV-2 spike protein. FIG. 14B. Non-immune plasma (anticoagulant: ACD-A) antibodies armed with armY-ACE2 also gain the ability bind to SARS-CoV-2 spike protein, comparable to non-immune serum antibodies armed with armY-ACE2. As expected, unarmed serum or plasma antibodies do not bind to SARS-CoV-2 spike protein. FIG. 14C. Less than 1 ug/ml of free-unengaged army-ACE2 remain detectable after a 60 minutes incubation with either serum- or plasma-antibodies at 37° C., suggesting at least 95% of armY-ACE2 added (20 ug/ml) readily engage and arm antibodies in solution.

FIG. 15A-B. FIG. 15A. Monoclonal antibody (mAb) armed with armY-ACE2 gains the ability to bind the SARS-CoV-2 spike protein, while in FIG. 15B mAb is no longer able to bind its natural target antigen.

FIG. 16. Specific detection of antibody light-chain (LC, ˜25 KDa), but not heavy-chain (HC, ˜50 KDa) by mono-biotinylated protein M on 1D gel electrophoresis by Western blot analysis. Antibody sample was loaded alone (right lane, Ab) or in a mixture with E. coli lysate (middle lane, E+Ab). E. coli lysate alone was also loaded (left lane, E) as control. The molecular weight standard values (KDa, left) are derived from the Coomassie blue stained blots.

FIG. 17. Biotinylated Protein M is immobilized in streptavidin-coated wells and serves as a surrogate antigen for a monoclonal antibody. Increasing amount of antibody is added to the wells and the level of bound antibody is measured in an ELISA-based method.

FIG. 18. Protein M fusion blocks the binding of a monoclonal antibody (mAb) to its natural antigen thereby a) shows that antigen binding is Fab dependent and b) confirms the antibody's binding specificity.

DETAILED DESCRIPTION Abbreviations and Definitions

Unless otherwise noted, technical terms are used according to conventional usage. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety for all purposes. To facilitate understanding of the invention, a number of terms and abbreviations as used herein are defined below as follows.

The terms “polypeptide”, “protein” and “peptide” are used herein interchangeably to refer to amino acid chains in which the amino acid residues are linked by peptide bonds or modified peptide bonds. The amino acid chains can be of any length of greater than two amino acids. Unless otherwise specified, the terms “polypeptide”, “protein” and “peptide” also encompass various modified forms thereof. Such modified forms may be naturally occurring modified forms or chemically modified forms. Examples of modified forms include, but are not limited to, glycosylated forms, phosphorylated forms, myristoylated forms, palmitoylated forms, ribosylated forms, acetylated forms, and the like. Modifications also include intra-molecular crosslinking and covalent attachment of various moieties such as lipids, flavin, biotin, polyethylene glycol or derivatives thereof, and the like. In addition, modifications may also include cyclization, branching and cross-linking. Further, amino acids other than the conventional twenty amino acids encoded by genes may also be included in a polypeptide. The term “polypeptide” or “protein” may also encompass a “purified” polypeptide that is substantially separated from other polypeptides in a cell or organism in which the polypeptide naturally occurs (e.g., 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, 100% free of contaminants).

Conservative changes: As used herein, when referring to mutations in a nucleic acid molecule, “conservative changes” are those in which at least one codon in the protein-coding region of the nucleic acid has been changed such that at least one amino acid of the polypeptide encoded by the nucleic acid sequence is substituted with another amino acid having similar characteristics. Examples of conservative amino acid substitutions are ser for ala, thr, or cys; lys for arg; gin for asn, his, or lys; his for asn; glu for asp or lys; asn for his or gin; asp for glu; pro for gly; leu for ile, phe, met, or val; val for ile or leu; ile for leu, met, or val; arg for lys; met for phe; tyr for phe or trp; thr for ser; trp for tyr; and phe for tyr.

Isolated polypeptide: The term “isolated polypeptide” as used herein means a polypeptide molecule is present in a form other than found in nature in its original environment with respect to its association with other molecules. The term “isolated polypeptide” encompasses a “purified polypeptide”, which is used herein to mean that a specified polypeptide is in a substantially homogenous preparation, substantially free of other cellular components, other polypeptides, viral materials, or culture medium, or when the polypeptide is chemically synthesized, substantially free of chemical precursors or byproducts associated with the chemical synthesis. For a purified polypeptide, preferably the specified polypeptide molecule constitutes at least 15 percent of the total polypeptide in the preparation. A “purified polypeptide” can be obtained from natural or recombinant host cells by standard purification techniques, or by chemical synthesis.

An “isolated” biological component (such as a nucleic acid molecule, protein, or virus) has been substantially separated or purified away from other biological components (e.g., other chromosomal and extra-chromosomal DNA and RNA, proteins and/or organelles). Nucleic acids, proteins, and/or viruses that have been “isolated” include nucleic acids, proteins, and viruses purified by standard purification methods. The term also embraces nucleic acids, proteins, and viruses prepared by recombinant expression in a host cell, as well as chemically synthesized nucleic acids or proteins. The term “isolated” (or purified) does not require absolute purity; rather, it is intended as a relative term. Thus, for example, an isolated or purified nucleic acid, protein, virus, or other active compound is one that is isolated in whole or in part from associated nucleic acids, proteins, and other contaminants.

As used herein, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. The term “vector” comprises an “expression vector”, e.g. a vector that is capable of directing the expression of genes to which they are operatively linked. The vector often includes sequences that effect the expression of a desirable molecule, e.g., a promoter, a coding region and a transcriptional termination sequence. An expression vector can be an integrative vector (i.e., a vector that can integrate into the host genome), or a vector that does not integrate but self-replicates, in which case, the vector includes an origin of replication which permits the entire vector to be reproduced once it is within the host cell. Preferred vectors are those capable of autonomous replication and/expression of nucleic acids to which they are linked.

Nucleic acid molecules encoding fusion proteins are also within the scope of the invention. Such nucleic acids can be made by preparing a construct (e g., an expression vector) that expresses a fusion protein when introduced into a suitable host. For example, such a construct can be made by ligating a first polynucleotide encoding a single-domain antibody, or fragment or variant thereof, fused in frame with a second polynucleotide encoding another protein such that expression of the construct in a suitable expression system yields a fusion protein. Polynucleotides that encode fusion proteins can be present in isolation, or can be inserted in a vector for expression in cells. Such vector may be suitable for replication and protein expression in bacterial, mammalian or insect cells. Polynucleotides that encode fusion proteins can be codon-optimized for expression in particular type of cells by standard methods known in the art.

A “codon-optimized” nucleic acid or polynucleotide refers to a nucleic acid sequence that has been altered such that the codons are optimal for expression in a particular system (such as a particular species or group of species). For example, a nucleic acid sequence can be optimized for expression in mammalian cells or in a particular mammalian species (such as human cells). Codon optimization does not alter the amino acid sequence of the encoded protein.

The term “neutralizing a pathogen” used herein is synonymous to “inactivating a pathogen” and means that the pathogen will no longer be able to interact with a specific receptor molecule either in vitro or in vivo, or will no longer be able to infect cells of an organism.

The term “neutralizing a toxin” used herein is synonymous to “inactivating a toxin” and means that the toxin will no longer be able to interact with its target, either in vitro, or in a subject's body.

The term “eradicating a pathogen” used herein refers to neutralizing the pathogen in a subject.

As used herein, the term “Protein M” or “armY” refers to antibody-binding fragment of protein from Mycoplasma genitalium that has an amino acid sequence SEQ ID NO:1 (Grover R K, et al., Science, 2014), or to antibody-binding fragment of protein from Mycoplasma pneumoniae that has an amino acid sequence SEQ ID NO:2 (Blötz C, et al., Front Microbiol. 2020), or to a polypeptide with immunoglobulin-binding activity having a sequence with at least 90% identity over its entire length to one of the following sequences: SEQ ID NO: 3-8. In some embodiments, the term “Protein M” or “armY” also includes an immunoglobulin-binding fragment of Protein M from Mycoplasma genitalium or Mycoplasma pneumoniae.

As used herein, the term “ACE2” refers to the human cellular angiotensin-converting enzyme 2 receptor.

As used herein, the term “fusion protein” refers to an artificial, non-natural polypeptide that consists of at least two unrelated covalently linked polypeptides. The linkage between these polypeptides can be of different nature, including a peptide bond, a short flexible amino acid spacer, or a spacer of another type. The spacer joins the polypeptides together, yet preserves some distance between the polypeptides such that both polypeptides can properly fold independently.

The term “immunoglobulin,” “Ig” or “antibody” (used interchangeably herein) refers to a glycoprotein formed in response to administration of bacteria, viruses or other antigens to a mammalian organism, said glycoprotein has the ability to specifically bind cognate antigen and consists of two heavy (H) chains and two light (L) chains connected and stabilized by interchain disulfide bonds. Immunoglobulins or antibodies may be monoclonal or polyclonal and may exist in monomeric or polymeric form, for example. IgM antibodies which exist in pentameric form and/or IgA antibodies which exist in monomeric, dimeric or multimeric form. The term “fragment” refers to a part or portion of an antibody or antibody chain comprising fewer amino acid residues than an intact or complete antibody or antibody chain. Fragments can be obtained via chemical or enzymatic treatment of an intact or complete antibody or antibody chain. Fragments can also be obtained by recombinant means. Exemplary fragments include Fab, Fab′, F(ab′)₂, Fc and/or Fv fragments.

The term “antigen-binding fragment” refers to a polypeptide portion of an immunoglobulin or antibody that binds an antigen or competes with intact antibody (i.e. with the intact antibody from which they were derived) for antigen binding (i.e. specific binding). Binding fragments can be produced by recombinant DNA techniques, or by enzymatic or chemical cleavage of intact immunoglobulins. Binding fragments include Fab, Fab′, F(ab′)₂, Fv, single chains, and single-chain antibodies.

As used herein, the term “toxin” refers to an endogenous entity or exogenous substance that is harmful to a subject (preferably, human subject). Examples of harmful endogenous entities are excessive inflammatory cytokines that may be produced during a cytokine storm in the subject. A harmful endogenous entity can be soluble or membrane bound. Examples of harmful exogenous substances are Botulinum neurotoxin A, Botulinum neurotoxin B, Staphylococcal enterotoxin A and B, Staphylococcal enterotoxin A, Staphylococcal enterotoxin B, Clostridium perfringens Epsilon toxin (ETX), Ricin, Anthrax.

As used herein, the term “donor compatible with the subject” refers to a human subject having compatibility for a blood transfusion (compatibility based on ABO blood groups, Rh Type).

As used herein, the term “receptor fragment” refers to a fragment of a protein to which a pathogen (usually, a protein from the pathogen's coat) or a toxin has a specific binding affinity, or can specifically bind. Preferably, receptor fragment is a protein fragment of a cellular receptor that the pathogen or toxin binds to and utilizes to enter the cell. Preferably, receptor fragment is located inside a subject's body.

Unless otherwise defined, technical and scientific terms used in the present teachings described herein shall have the meanings that are commonly understood by those of ordinary skill in the art. Further, unless otherwise required by context, plural terms shall include the singular and singular terms shall include pluralities. Generally, nomenclatures utilized in connection with molecular biology, cell and tissue culture, protein and oligo- or polynucleotide chemistry described herein are well-known and commonly used in the art. Standard techniques are used, for example, for recombinant nucleic acid and protein preparation, purification and analysis, for oligonucleotide synthesis. Purification techniques and enzymatic reactions are performed according to manufacturer's specifications or as described herein or as commonly accomplished in the art. The techniques and procedures described herein are generally performed according to conventional methods well known in the art and as described in various general references that are cited and discussed throughout the instant specification. See, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual (Third ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 2000). The nomenclatures utilized in connection with, and the laboratory procedures and techniques described herein are those well-known and commonly used in the art.

The present invention is directed to methods and compositions for inactivating or eliminating a pathogen, preferably a bloodborne pathogen having a specific binding affinity for a receptor fragment, by utilizing a fusion protein that comprises Protein M and the receptor fragment. Preferably, Protein M is chosen from an extracellular domain of Mycoplasma genitalium protein (Grover R K, et al., Science, 2014; SEQ ID NO: 1) or an extracellular domain of Mycoplasma pneumoniae protein (Blötz C, et al., Front Microbiol. 2020; SEQ ID NO: 2) that strongly bind to immunoglobulin molecules (antibodies). Typical binding affinities (K_(d)) of Protein M to immunoglobulin molecules are from 1.2 to 5.2 nM (Grover R K, et al., Science, 2014).

Orthologs of Protein M can be found in several related species of Mycoplasma: M. penetrans, Mycoplasma tullyi, Mycoplasma iowae, Mycoplasma imitans, Mycoplasma alvi and M. gallisepticum (disclosed herein in the Sequence listing). These sequences are also disclosed herein and can be used to create fusions or fusion proteins according to the present invention. Protein M is functionally similar to other bacterial-derived proteins that bind antibodies (e.g., protein A, protein G and protein L) with the exception that Protein M blocks the antibody's binding site and prevent it from binding its cognate antigen. Therefore, harnessing the antibody binding property of Protein M, it can be used to couple any attached compounds (e.g., genetic fusion or chemical conjugation) to an antibody regardless of the antibody's specificity. Consequently, interaction with the Protein M fusion protein will result in the loss of the antibody's specificity and acquire the specificity as that of the attached compound. The properties of Protein M fusion protein with the compound will be a combination of the antibody's stability, antibody's functional properties (such as ability to engage Fc receptors on immune cells, activate the complement system, an increased binding avidity and the compound properties (affinity to a pathogen).

Preferred nucleic acid molecules for use in the invention are polynucleotides that encode fusion proteins shown herein in the appended Sequence Listing. Nucleic acid molecules utilized in the present invention may be in the form of RNA or in the form of DNA (e.g., cDNA, genomic DNA, and synthetic DNA). The nucleic acid molecule may be double-stranded or single-stranded, and if single-stranded may be the coding (sense) strand or non-coding (anti-sense) strand. The coding sequence which encodes a fusion Protein May be identical to one of the nucleotide sequences provided in the appendices, or it may also be a different coding sequence which, as a result of the redundancy or degeneracy of the genetic code, encodes the provided fusion protein.

In some embodiments, variant fusion proteins displaying substantial differences in structure can be generated by making nucleotide substitutions that cause less than conservative changes in the encoded polypeptide. Examples of such nucleotide substitutions are those that cause changes in (a) the structure of the polypeptide backbone; (b) the charge or hydrophobicity of the polypeptide; or (c) the bulk of an amino acid side chain. Nucleotide substitutions generally expected to produce the greatest changes in protein properties are those that cause non-conservative changes in codons. Examples of codon changes that are likely to cause major changes in protein structure are those that cause substitution of (a) a hydrophilic residue, e.g., serine or threonine, for (or by) a hydrophobic residue, e.g., leucine, isoleucine, phenylalanine, valine or alanine; (b) a cysteine or proline for (or by) any other residue; (c) a residue having an electropositive side chain, e.g., lysine, arginine, or histidine, for (or by) an electronegative residue, e.g., glutamic acid or aspartic acid; or (d) a residue having a bulky side chain, e.g., phenylalanine, for (or by) one not having a side chain, e g., glycine.

Sequence Identity: As used herein, the term “sequence identity” means the percentage of identical subunits at corresponding positions in two sequences when the two sequences are aligned to maximize subunit matching, i.e., taking into account gaps and insertions. Sequence identity is present when a subunit position in both of the two sequences is occupied by the same nucleotide or amino acid, e.g., if a given position is occupied by an adenine in each of two DNA molecules, then the molecules are identical at that position. For example, if 7 positions in a sequence of 10 nucleotides in length are identical to the corresponding positions in a second 10-nucleotide sequence, then the two sequences have 70% sequence identity. Sequence identity of a polynucleotide is typically measured using sequence analysis software (e.g., the Sequence Analysis Software Package of the Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, Wis. 53705).

In preferred embodiments, variant fusion proteins displaying only non-substantial or negligible differences in structure can be generated by making nucleotide substitutions that cause only conservative amino acid changes in the encoded polypeptide. By doing this, fusion protein variants that comprise a sequence having at least 90% (90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, and 99%) sequence identity with the fusion protein sequences provided in the attached appendices, and retain at least one functional activity, e g., immunoglobulin binding activity. The invention also covers non-naturally occurring polynucleotides or variants that encode the fusion protein variants having at least 90% sequence identity over the entire length with the fusion protein sequences provided in the attached appendices, and retain at least one functional activity, e g., immunoglobulin binding activity. Methods of making targeted amino acid substitutions, deletions, truncations, and insertions are generally known in the art. For example, amino acid sequence variants can be prepared by mutations in the DNA. Methods for polynucleotide alterations are well known in the art, for example, Kunkel et al. (1987) Methods in Enzymol. 154:367-382; U.S. Pat. No. 4,873,192 and the references cited therein.

Therapeutically Effective Amount: As used herein, the term “therapeutically effective amount” refers to those amounts that, when administered to a particular subject in view of the nature and severity of that subject's disease or condition, will have a desired therapeutic effect, e.g., an amount which will cure, prevent, inhibit, or at least partially arrest or partially prevent a target disease or condition. By other words, this is an amount of an agent or composition that alone, or together with a pharmaceutically acceptable carrier or one or more additional agents, induces the desired response. Effective amounts of a therapeutic agent can be determined in many different ways, such as assaying for a reduction in symptoms or improvement of physiological condition of a subject. Effective amounts also can be determined through various in vitro, in vivo, or in situ assays.

In some embodiments, variants of fusion proteins having a reduced immunogenicity in humans may be generated by making amino acid substitutions in the fusion proteins that remove or modify human T-cell or B-cell epitopes present in said fusion protein. Fusion proteins that have less potential human T-cell or B-cell epitopes in the sequence are less prone to activate an unwanted immune response in a subject. The unwanted immune response includes development of anti-fusion protein antibodies that may neutralize said fusion protein. Several methods for identifying, modifying and removing potential human T-cell or B-cell epitopes in protein sequences are known and disclosed in, for example, Jawa V, Terry F, Gokemeijer J, et al. T-Cell Dependent Immunogenicity of Protein Therapeutics Pre-clinical Assessment and Mitigation-Updated Consensus and Review 2020. Front Immunol. 2020; 11:1301; Mazor R, Crown D, Addissie S, Jang Y, Kaplan G, Pastan I. Elimination of murine and human T-cell epitopes in recombinant immunotoxin eliminates neutralizing and anti-drug antibodies in vivo. Cell Mol Immunol. 2017; 14(5):432-442; U.S. Ser. No. 10/751,397 B2, US2018161419A1, the contents of which are incorporated herein by reference in its entirety.

Disclosed herein are methods for making and using fusion proteins that comprises amino acid sequences of Protein M or amino acid sequences that are at least 90% identical over the entire length with the sequences of Protein M. An example of such fusion protein is armY-ACE2, which consists of the Protein M sequence fused to the sequence of the ACE2 receptor, or to a fragment of the ACE2 receptor to which the envelope spike S protein of the SARS-CoV-2 virus is bound. Fusion protein armY-ACE2 can bind to immunoglobulin molecules of different classes, blocking their original specificity and instead directing them to interact with the envelope spike S protein of the SARS-CoV-2 virus (FIG. 1). As a result, the SARS-CoV-2 virus will be no longer capable of infecting human cells via its envelope spike S protein, and will be eliminated by macrophages that recognize immunoglobulin-bound targets, and by engaging via a complement factor activated by the bound immunoglobulins, or by other mechanisms. By utilizing knowledge of specific cellular receptors recognized by pathogens (virus or microorganism) and toxins for cellular entry, various armY-fusion proteins may be created and utilized according to the present invention. To make armY-fusion proteins, various fragments of the receptor may be used, including, without restriction, a full extracellular domain of the receptor or a fragment of the receptor which is necessary and sufficient for interaction with the pathogen or toxin.

Non-limiting examples of pathogens and toxins and their cellular attachment receptors suitable to make armY-fusion proteins are listed as follows: (a) armY-ACE2 (Angiotensin-converting enzyme 2) for the SARS-CoV and SARS-CoV-2, as well as human coronavirus NL63/HCoV-NL6; (b) armY-CD209 (DC-SIGN) for HIV-1, HIV-2, Ebolavirus, Cytomegalovirus, HCV, Dengue virus, Measles virus, Herpes simplex virus 1, Influenza virus, SARS-CoV, Japanese encephalitis virus, Lassa virus, Respiratory syncytial virus, Rift valley fever virus, West-nile virus, Marburg virus, Uukuniemi virus, and Yersinia Pestis; (c) armY-C-type lectin domain family 4 member M for Ebolavirus, Hepatitis C virus, HIV-1, Human coronavirus 229E, Human cytomegalovirus/HHV-5, Influenza virus, SARS-CoV, West-nile virus, Japanese encephalitis virus, Marburg virus glycoprotein, and M. bovis; (d) armY-CD4 for HIV; (e) armY-Synaptic vesicle glycoprotein 2A for the C. botulinum neurotoxin type A2 (BoNT/A, botA); (f) armY-Synaptic vesicle glycoprotein 2B for the C. botulinum neurotoxin type A2 (BoNT/A, botA). Probably also for the closely related C. botulinum neurotoxin type A1; (g) armY-Synaptic vesicle glycoprotein 2C for C. botulinum neurotoxin type A (BoNT/A, botA) and C. botulinum neurotoxin type A2; (h) armY-Synaptotagmin I for C. botulinum neurotoxin type B (BoNT/B, botB); (i) armY-Synaptotagmin II for C. botulinum neurotoxin type B (BoNT/B, botB); (j) armY-HLA class II histocompatibility antigen, DRB1 beta chain for Epstein-Barr virus and Staphylococcal enterotoxin A and B; (k) armY-HLA class II histocompatibility antigen, DR alpha chain for Epstein-Barr virus BZLF2/gp42, Staphylococcus aureus enterotoxin A/entA, enterotoxin B/entB, enterotoxin C1/entC1, enterotoxin D/entD, and enterotoxin H/entH; (1) armY-T cell receptor beta variable 7-9 for Staphylococcus aureus enterotoxin A/entA; (m) armY-T cell receptor beta variable 19 for Staphylococcus aureus enterotoxin B/entB; (n) armY-Hepatitis A virus cellular receptor 1 for Hepatitis A virus, Ebola virus, Marburg virus and Dengue virus and Clostridium perfringens Epsilon toxin (ETX); (o) armY-Myelin and lymphocyte protein for Clostridium perfringens Epsilon toxin (ETX); (p) armY-Complement factor H for Streptococcus pneumoniae, Neisseria meningitides, Staphylococcus aureus, Borrelia burgdorferi and West nile virus; (q) armY-Hepatocyte growth factor receptor for Listeria monocytogenes internalin InlB; (r) armY-Membrane cofactor protein (CD46) for Adenovirus subgroup B2 and Ad3, Measles virus, Herpesvirus 6/HHV-6, Neisseria and Streptococcus pyogenes; (s) armY-Glycophorin-A for Plasmodium falciparum, Influenza virus, Hepatitis A virus (HAV), Streptococcus gordonii; (t) armY-C-type lectin domain family 4 member K (Langerin, CD207) for Candida species, Saccharomyces species, Malassezia furfur, human immunodeficiency virus-1 (HIV-1) and Yesinia pestis; (u) armY-Anthrax toxin receptor 1 for Anthrax toxin; and (v) armY-Anthrax toxin receptor 2 for Anthrax toxin.

In some embodiments, codon-optimized polynucleotides are disclosed that contain a nucleic acid sequence at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical to SEQ ID NO: 40-61. These polynucleotides are codon-optimized for expression in human cells.

Taking as an example armY-ACE2 fusion protein and SARS-CoV-2 as a pathogen, several advantages of the armY-ACE2 approach can be shown over the other known potential virus inactivating strategies, such as (a) monoclonal antibody (mAb) therapy; (b) ACE2 or ACE2-Fc fusion proteins therapy; (c) Convalescent plasma antibody therapy, and (d) anti-viral vaccine.

As to (a), mAb therapy is subject to viral escape due to a mutation in a targeted viral epitope. Most viruses possess a high mutation rate; after a mutation in the mAb-recognizing area the mAb therapy is no longer effective, and mutated viruses will proliferate and eventually will be enriched. Instead, armY-ACE2 will bind the SARS-CoV-2 virus regardless of any mutation, because all SARS-CoV-2 viruses bind ACE2 for entry into human cells. Also, since armY can bind all antibody isotypes, armY-ACE2 can arm all antibody isotypes with the capacity to target SARS-CoV-2 viruses, hence mimicking a generalized antibody-mediated immune response.

As to (b), ACE2 monotherapy suffers from rapid renal clearance due to the small size of ACE2. ACE2-Fc fusion proteins is of a single isotype, usually, an IgG. It is known that other isotypes e.g., IgM, IgA are also efficacious in pathogen clearance. Thus, armY-ACE2 can arm all isotypes or a specific isotype with the capacity to target SARS-CoV-2. In addition, Fc fusion proteins do not activate the complement system. Instead, armY-ACE2 complex with antibody maintains Fc functionality, and is able to prime the antibody to bind the C1q complement factor, a required step for complement activation. Being able to harness the full effector potential of antibodies may be critical in the overall eradication of the targeted pathogen, such as SARS-CoV-2.

As to (c), convalescent plasma therapy requires blood from donors previously exposed to SARS-CoV-2, and no longer with COVID-19 symptoms. It might take as long as 7-10 days to test for lack of blood-borne pathogens, anti-SARS-CoV-2 titer levels and ABO blood type matching requirements. Instead, armY-ACE2 could arm the patient's own plasma antibodies, and can be available to the patient in less than 2-4 hours. Donor plasma can also be used, but these can be from regular donors that have already been screened, so this could be made available to the patient even faster as long as ABO blood type and Rh type match is achieved.

As to (d), SARS-CoV-2 vaccine is prophylactic in its use and the uninfected person will require time to develop a level of protective immunity. Vaccines cannot be a therapeutic for those with on-going COVID-19. Moreover, vaccine efficacy is subject to many variables including state of health of the individual and potential side-effects, e.g., anaphylactic reaction that might hinder completion of immunization protocol. armY-ACE2 is applicable to subjects with on-going COVID-19.

The abovementioned advantages apply to other fusion proteins that are disclosed herein.

Treatment with Protein M fusion proteins changes the specificity of antibodies in plasma to a new target (e.g, a virus, bacterium or a toxin) for immune recognition and elimination; provide more optimal pharmacokinetics and activity of a compound attached to a larger more stable antibody, and improvement of bioavailability of compounds; deliver therapeutic or diagnostic compounds to an antibody-binding target (e.g., antibody binding bacteria, tissue or cell); disrupt interaction between two or more entities required for pathogenicity.

Possible routes of administration for Protein M fusion proteins include parenteral, oral and/or inhalation. In a preferred embodiment, ex-vivo plasma/serum (patient-derived or from a compatible donor) is mixed with Protein M-fusion protein and administered to patient. Preferably, Protein M fusion proteins are administered in the form of a pharmaceutical composition, comprising additional pharmaceutically acceptable excipients.

In some embodiments, Protein M fusion proteins are stored or administered in a suitable formulation that provides stability to the fusion proteins. Such formulation includes one or several pharmaceutically acceptable excipients. By “pharmaceutically acceptable” it is meant the excipient is compatible with the other ingredients of the formulation and not deleterious to the recipient thereof. Excipients for protein formulations may be picked up by methods known in the art, and may include buffers, stabilizers, antioxidants, salts, polysorbates, amino acids, among others.

Other potential uses of Protein M fusion proteins include detecting presence of antibodies and/or antibody-binding factors found in blood, tissue or cells. For example, fusing a reporter enzyme (e.g., Horseradish peroxidase, HRP) or attaching a detectable probe or label (e.g., biotin-avidin, biotin-streptavidin) to Protein M can be used to detect antibodies that are present but not bound to their cognate antigen as observed in immunoassays that exhibit “false-positive” activity and thus serve as a false-positive detection tool.

In some embodiments, Protein M can be conjugated with the following detectable probes: HRP (chromogenic), alkaline phosphatase (chromogenic), biotin (for example, via Avi-Tag peptide), myc epitope antigen, Luciferase (bioluminescence), avidin (attachment of biotin conjugates), streptavidin (attachment of biotin conjugates), streptavidin-binding peptide, phycoerythrin (fluorescence), GFP (fluorescence), a radioactive label. Protein M-radiolabel peptide can be produced by fusing Protein M to the short peptide KGRPLVY (SEQ ID NO:62). As disclosed in Mebrahtu et al. 2013, the KGRPLVY peptide contains a metal chelate attachment [K-lysine for labeling Protein M with Cu-64 and DOTA] and radio-halogen attachment (Y-tyrosine for labeling Protein M with I-125, I-123 or I-131).

In some embodiments, Protein M-detectable probe fusions can be used in ELISA, western blotting, lateral flow assays, multiplex bead array assays, pull down assays, SPR (biacore, octet) assays, flow cytometry assays, for purification or for delivery of a cargo.

Protein M fusion proteins can also be used to: 1) neutralize antibodies by occupying their antigen binding site (Useful in decreasing non-specific signals in immunoassays, useful in in-vitro cell assays as well as in in-vivo settings to determine the role of antibodies or a specific antibody by essentially blocking its binding activity); 2) eliminate antibodies by increasing clearance from circulation or tissue by directing antibodies to immune cells or delivering degrading enzymes or compounds to antibodies; 3) deplete antibodies in solution by promoting clearance of unengaged antibodies, which are not bound to antigen. Protein M can be attached to a resin (e.g., agarose beads), added to a solution to pull down/remove or harvest such antibodies for use in process, for analysis or for elimination.

Protein M fusion proteins can also be used to protect antibodies from degradation by enzymes, microbes and cellular mechanisms; protect antibodies from bacterial escape mechanisms (e.g., protein A of S. aureus binds to antibodies and avoid antibody detection and clearance); deliver cargo to an antibody.

In some embodiments, the receptor fragment is a protein fragment of a cellular receptor, which is a target used by a pathogen for cell entry. In some embodiments, the pathogen is a virus, a bacterium or a fungus that can cause illnesses. In one embodiment, an antigen is a cell surface molecule of a pathogen, or antigenic parts or fragments thereof.

A fusion protein can be made by creating a nucleic acid molecule encoding the fusion protein and expressing the fusion protein from such nucleic acid in a recombinant expression system. The nucleic acid molecule encoding the fusion can be generated by linking a nucleic acid sequence encoding Protein M in frame with a nucleic acid sequence encoding a receptor fragment of a pathogen or a ligand of a toxin. Methods for constructing a fusion protein are known in the art (see Sambrook J. et al., Molecular Cloning, Cold Spring Harbor Press, New York (2001)).

In some embodiments, Protein M is fused to the N-terminus of the receptor fragment of a pathogen or the ligand of a toxin. In this orientation, an N-terminal tag can be attached for detection and purification of the fusion protein. In addition, the leader sequence (secretory signal peptide) can be attached for facilitating the secretion of the fusion protein. Alternatively, other appropriate leader sequences, suitable for guiding the fusion protein to the ER and the secretory pathway in the host cell, can be used. In other embodiments, Protein M is fused to the C-terminus of the receptor fragment of a pathogen or the ligand of a toxin.

In still another embodiment, a spacer can be incorporated between the Protein M sequence and the receptor fragment of a pathogen or the ligand of a toxin. In preferred embodiments, spacer is a short peptide sequence that joins both polypeptides, yet preserves some distance between the polypeptides such that both polypeptides can properly fold independently. Generally, the spacer consists of between 2 or 3 amino acids to 50 amino acids, typically between 3 to 25, or 3 to 20, or 3 to 15 amino acids. In a specific embodiment, the space consists of 3-10 amino acids. Although there is no specific restriction on the selection of amino acids for the spacer region, the amino acids can be selected to accommodate the folding, net charge, hydrophobicity or other properties of the fusion protein. Typical amino acids for use in a spacer region include Gly, Ala, Ser, Thr and Asp.

One of skill would recognize that modifications can be made to a fusion protein without diminishing their biological activities. Some modifications may be made to facilitate the cloning, expression, or incorporation of the constituent molecules into a fusion protein. For example, amino acids can be placed on either terminus to create conveniently located restriction sites or termination codons; and a methionine can be added at the amino terminus to provide an initiation site.

Recombinant Expression of the Fusion Proteins.

For recombinant expression of a fusion protein, a nucleic acid molecule encoding the fusion protein is generally placed in an expression vector in an operable linkage to a promoter (such as the T7, trp, or lambda promoters for expression in bacteria, or a CMV promoter for expression in mammalian cells) and a 3′ transcription termination sequence, and optionally additional suitable transcriptional and/or translational regulatory elements such as a transcription enhancer sequence and a sequence encoding suitable mRNA ribosomal binding sites. Additional sequences that can be included in the expression vector include an origin of replication, and a selection marker gene to facilitate identification of transformants such as genes conferring resistance to antibiotics (e.g., the amp, kana, gpt, neo, and hyg genes).

Host cells suitable for use in the recombinant expression of the fusion protein include bacterial cells such as E. coli, and eukaryotic cells including but not limited to yeast, insect cells (e.g. SF9 cells), and mammalian cells such COS, CHO, HeLa cells and HEK293.

The expression vectors can be introduced into a host cell by well-known methods such as calcium chloride transformation for bacterial cells, and calcium phosphate treatment or electroporation for mammalian cells. Cells transformed by the expression vectors can be selected based on the phenotype provided by the selectable marker gene.

Once expressed, the recombinant fusion proteins can be purified according to standard methods available in the art, such as ammonium sulfate precipitation, affinity columns, chromatography, gel electrophoresis, among others. In one embodiment, the fusion protein is purified based on affinity chromatography using antibodies that bound to Protein M. In another embodiment, a purification tag is inserted at the N-terminus or the C-terminus of the fusion protein and is used for purification. The examples of such tags are: 6 His-tag, myc-tag, strep-tag and others.

In some embodiments, uses for Protein M fusion proteins include the following.

The present teachings include a pharmaceutical composition comprising: a Protein M fusion protein having an antibody-binding domain and an ACE2 cellular receptor (referred to as armY-ACE2), serum or plasma from the subject or from a compatible donor, and a pharmaceutically acceptable carrier or a pharmaceutically acceptable excipient, wherein the fusion protein acts to eradicate SARS-CoV and SARS-COV-2 coronaviruses in patients infected with the virus, wherein the fusion protein arms immunoglobulins to recognize and bind with high affinity to the S1 spike protein expressed by the SARS-CoV and SARS-COV-2 coronaviruses. In some embodiments, the Protein M fusion protein optionally comprises a linker, the antibody-binding domain comprises the Protein M protein from Mycoplasma sp., the antibody-binding domain comprises Protein M that binds with high affinity to the antibody Fab domain and blocks the antibody's antigen binding site; the antibody-binding domain comprises Protein M that does not bind to the antibody whose Fab domain binding site is engaged with its cognate antigen; the antigen-binding domain comprises a cellular receptor, ACE2, that binds with high affinity to the S1 spike protein expressed by the SARS-CoV and SARS-COV-2 coronaviruses; the antibody-binding domain comprises Protein M that binds with high affinity to the antibody Fab domain and blocks the antibody's cognate antigen binding site. In some preferred embodiments, immunoglobulins bound with the disclosed fusion proteins retain at least partially Fc-linked functional activities (effector functions), such as Fc-receptor binding and complement activation.

In some embodiments, Protein M fusion proteins comprise a linker between Protein M and receptor fragment. Non-limiting examples of such linkers include

(SEQ ID NO: 12) GGGGSGGGGSGGGGS, (SEQ ID NO: 13) GGGGSGGGGS or (SEQ ID NO: 14) GGGGS.

The present teachings also include a pharmaceutical composition comprising: a Protein M fusion protein having an antibody-binding domain and a fusion domain comprising a protein, peptide or chemical group able to bind a pathogen, a toxin, any biologic entity or a chemical group, serum or plasma from the subject or from a compatible donor, and a pharmaceutically acceptable carrier or a pharmaceutically acceptable excipient. In some embodiments, the antigen to be bound by the fusion protein comprises an antigen arising from a pathogen, a toxin, a subject, arising from a disease state within the subject, or arising from a disease related organism within the subject and the disease state within the subject is caused by a virus, bacteria, tumor, abnormal cell or by exposure to an external disease-causing agent, wherein the antigen-binding domain comprises one or more protein or peptide or chemical group (collectively referred to as molecules) chosen from the group consisting of: a soluble molecule, a soluble molecule bound to a matrix, an insoluble molecule bound to a matrix, an insoluble aggregate of molecules, a molecule comprising one or more epitopes, a nonviable cell-associated molecule, a nonviable organism-associated molecule, or a molecule conjugated with a liposome.

The present teachings also include a Protein M fusion protein having an antibody-binding domain and a fusion partner domain comprising a protein, peptide or chemical group, wherein the antibody-binding domain comprises Protein M that does not bind to the antibody whose Fab domain binding site is engaged with its cognate antigen. In some embodiments, the fusion partner domain may be an endogenous protein or peptide; the fusion partner domain may be an exogenous protein or peptide; the fusion partner domain may be an enzyme, wherein the enzyme is a reporter enzyme horseradish peroxidase fusion protein (HRP). Protein M-HRP may be used to detect immunoglobulins in solution or in a matrix, wherein the immunoglobulins detected are not engaged with their cognate antigen. Thus, Protein M-HRP may be used to identify or rule out false positive test results in antibody-based detection of antigen. The fusion partner domain may permit for a chemical modification, wherein the chemical modification is, for example, an addition of biotin by an enzymatic conjugation of a single biotin on a unique 15 amino acid peptide tag using the biotin ligase (BirA).

The present teachings also include a Protein M fusion protein having an antibody-binding domain and a fusion partner domain comprising a protein, peptide or chemical group, wherein the antibody-binding domain comprises Protein M that binds with high affinity to the antibody Fab domain and blocks the antibody's antigen binding site, wherein the fusion partner domain may be a cytokine, chemokine, hormone, growth factor, receptor, ligand, neurotransmitters or a synthesized molecule. In some embodiments, the fusion partner domain is made to increase or decrease the bioavailability of bound antibodies, or the fusion partner domain immunogenicity is increased or decreased when bound to antibodies; or the fusion partner domain is made to increase or decrease the immunogenicity of bound antibodies.

In some embodiments, Protein M fusion proteins arm free non-antigen bound immunoglobulin to bind a pathogen or toxin (both referred heretofore as “target”) with a high affinity. This is made possible through (a) Protein M component of the fusion protein that engages the immunoglobulin rendering it no longer able to bind its cognate antigen, and (b) the fused receptor or ligand, which is the same attachment receptor or ligand found on cells that the target uses to attach and gain entry. Protein M fusion protein-armed immunoglobulins (referred heretofore as “armY-fusion”) binding to their target is the initial step in the mechanism of target eradication. Once bound to target, armY-fusion will block the interaction between the target and the attachment receptor found on host cells, thereby, neutralizing the target and prevent it from infecting the cell. Whereas Protein M fusions serve to associate immunoglobulins with the target and neutralize the target, the immunoglobulin serves to mark the target for destruction and clearance by the innate immune system including cells that bear Fc receptors (e.g., macrophages) and complement factors.

Complement is part of the innate surveillance system involved in the first line of defense against pathogens. One mechanism to direct complement to a specific pathogen is via the classical complement pathway, which is initiated by antibodies that are bound to antigen. C1q recruitment to antibodies is an essential first step in the activation of the complement cascade. Antibody binding to antigen (found on the pathogen or in solution as an immune complex) induces a change in the antibody's three-dimensional structure that exposes a C1q binding site found within the CH2 portion of the antibody Fc region. Upon C1q binding and activation, additional complement factors are recruited resulting in the formation of other effector molecules such as C3b, the main effector of the complement system. These events culminate in the formation of the membrane attack complex (MAC) that forms holes or pores on the surface of pathogens including bacteria, viruses and cancer cells resulting in subsequent clearance. C3b also serves as a potent opsin able to tag pathogens, immune complexes (antigen-antibody), and apoptotic cells for phagocytosis by immune cells that express C3b receptors. Together, MAC and C3b serve to effectively eradicate pathogens targeted by antibodies that recruit C1q. C1q is composed of 18 polypeptide chains: six A-chains, six B-chains, and six C-chains. Each chain contains a collagen-like region located near the N terminus and a C-terminal globular region.

In some embodiments, Protein M fusion protein in complex with an antibody can engage C1q and activate classical complement pathway that would contribute to eradication of the pathogen or a cancer cell, to which the Protein M fusion protein is targeted. Normally, for C1 q to bind the antibody, the antibody must first bind its antigen (immobilized on a cell or pathogen or in solution as an immune complex), and then the antibody undergoes a conformational change that permits C1q binding. However, Protein M fusion protein-IgG complex can specifically recruit C1q as demonstrated, for example, in Example 10, FIG. 11 below. Thus, Protein M-fusions can be considered a tool to specifically induce a conformational change in antibodies (while in solution) resulting in its ability to engage C1q and activate the complement pathway.

EXAMPLES

Aspects of the present teachings may be further understood in light of the following examples, which should not be construed as limiting the scope of the present teachings in any way. Below, exemplary methods to develop and characterize Avi-/myc-tagged Protein M, myc-tagged Protein M-HRP fusion protein and Protein M-ACE2 fusion protein (aka, armY-ACE2) are disclosed. These and similar methods can be applied to generate and use different Protein M fusion proteins.

Example 1. Gene Construction of Protein M

Protein M (also referred to as armY) (SEQ ID NO:10 and 38) was constructed using the mature amino acid sequence of Protein M (37-556 amino acid) containing a myc-tag (EQKLISEEDLLRKR) and linker sequence (AANGGGGSGGGGS) and a mono-biotinylation sequence “Avi-Tag” (MAGGLNDIFEAQKIEWHEGG) at its N-terminal end. The linear amino acid sequence was reverse translated to its corresponding DNA sequence using the free GenSmart™ Codon Optimization Tool by GenScript for expression in human cells (gensmart-free-gene-codon-optimization). This sequence was submitted for gene synthesis and inserted into the plasmid cloning vector pUC57 (GenScript, Inc.). The insert was amplified and cloned into a previously constructed mammalian cell expression vector pcDNA3(−) containing a myc-tag-Protein M sequence by replacing the myc-tag-Protein M sequence with the above myc-tag sequence that included a mono-biotinylation sequence, producing a final Protein M construct (IL-2 leader sequence—biotinylation tag—myc tag—linker—Protein M). The plasmid expression vector construct was verified by restriction enzyme analysis, amplified in E. coli and purified using a maxiprep kit (GenScript Inc. and Eton Bioscience, Inc.).

Example 2A. Characterization of Protein M as an Antibody Neutralizer and Blocking Reagent Tool. Binding of Protein M to Immobilized Antibody (FIG. 3)

Protein M binding to plate bound antibody was demonstrated by measuring the amount of myc-tagged Protein M bound to the antibody coated on a 96-well plate by an ELISA-based method.

Briefly, 5 ug/ml of human IgG (Sigma) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. After washing twice with PBS+Tween 20 (wash buffer, Pierce), Protein M in expression medium diluted in assay buffer (0.5% BSA in PBS+Tween 20) or assay buffer was added to antibody-coated wells in duplicate. After approximately 30 minutes at room temperature, the wells were washed and mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer was added to detect the myc-tagged Protein M. After approximately 30 minutes, the wells were washed 3×s and anti-mouse IgG labeled with HRP was added to the wells. After approximately 30 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 2B. Characterization of Protein M as an Antibody Neutralizer and Blocking Reagent Tool. Protein M Neutralizes/Blocks Antibody Binding to Cognate Antigen (FIG. 4)

The ability of Protein M to block binding to its cognate antigen was demonstrated by measuring the amount of unblocked, free antibody bound to its antigen coated on a 96-well plate by an ELISA-based method.

Briefly, 5 ug/ml of human IgG (Sigma) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. RP-labeled goat anti-human IgG antibody (GenScript, Inc.) or a RP-labeled chicken anti-human IgG antibody (Aves Labs, Inc.) was added to Protein M in expression medium or to expression medium alone and allowed to form complexes at room temperature for approximately 2 hours. After washing twice with PBS+Tween 20 (wash buffer, Pierce), samples were added to human IgG coated wells in duplicate. After approximately 45 minutes at room temperature, the wells were washed 3×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 3. PROTEIN M-HRP Fusion Protein (SEQ ID NO:11 and 39). Gene Construction of Protein M-HRP

The mature amino acid sequence of horseradish peroxidase HRP (31-338 amino acid) was generated containing a myc-tag (EQKLISEEDL) and linker (AAN) sequence at its N-terminal end. The amino acid sequence encoding 3 sets of 4 glycine residues and 1 serine residue (e.g., GGGGS)3 linker followed by the mature amino acid sequence of Protein M (37-556 amino acid) was added to its C-terminal end producing a final Protein M-HRP construct containing (IL-2 leader sequence—myc tag—HRP—linker—Protein M). The linear amino acid sequence was reverse translated to its corresponding DNA sequence using the free GenSmart™ Codon Optimization Tool by GenScript for expression in human cells (gensmart-free-gene-codon-optimization). This sequence was submitted for gene synthesis and inserted into the plasmid cloning vector pUC57 (GenScript USA Inc.). The insert was amplified and cloned into a mammalian cell expression vector, pcDNA3(−). The plasmid expression vector construct was verified by restriction enzyme analysis, amplified in E. coli and purified using a maxiprep kit (GenScript Inc. and Eton Bioscience, Inc.).

Example 4. Characterization of Protein M-HRP as a Novel Antibody Detection Reagent Tool. Detection of Immobilized F(Ab′)₂, Antibody or Antibody in Solution by Protein M-HRP Fusion Protein

Protein M-HRP direct detection of plate bound F(ab′)₂, antibody or indirect detection of antibody in solution was demonstrated by measuring the amount of Protein M-HRP bound to antibody coated on a 96-well plate by an ELISA-based method (FIG. 5). Direct detection of antibody by Protein M-HRP fusion protein: Briefly, 2 ug/ml of goat F(ab′)₂ or 5 ug/ml of human IgG (Sigma) or 2 ug/ml of mouse IgG1 isotype anti-myc or anti-human CD28 antibodies was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. After washing twice with PBS+Tween 20 (wash buffer, Pierce), Protein M-HRP in expression medium or expression medium diluted in assay buffer (0.5% BSA in PBS+Tween 20) was added to F(ab′)₂ or antibody-coated wells in duplicate. After approximately 30 minutes at room temperature, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software). Indirect detection of antibody in solution by Protein M-HRP fusion protein: Briefly, Protein M-HRP was incubated with varying amounts of mouse IgG1 antibody in assay buffer and allowed to form complexes at room temperature for approximately 30 minutes. Protein M-HRP alone was also included as a positive control. After washing twice with PBS+Tween 20 (wash buffer, Pierce), samples were added to human IgG-coated wells in duplicate. After approximately 30 minutes at room temperature, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 5. Absence of Detection of Antibody Bound to an Immobilized Antigen by Protein M-HRP Fusion Protein

Protein M does not bind to antibodies already bound to antigen. The absence of detection of antibody bound to an immobilized antigen by Protein M-HRP fusion protein was demonstrated by measuring the amount of Protein M-HRP bound to the antibody engaged with its antigen on a 96-well plate by an ELISA-based method (FIG. 6).

Briefly, 1 ug/ml of a myc-tagged protein or 2 ug/ml of human ASIP (agouti-signaling protein, RnD Systems) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. After washing twice with PBS+Tween 20 (wash buffer, Pierce), mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer, rabbit anti-ASIP antibody (Thermofisher) or assay buffer alone was added to the myc-tagged protein or ASIP, coated wells, respectively, in duplicates. After approximately 60 minutes, the wells were washed 3×s and Protein M-HRP was added. To show that mouse anti-myc and rabbit anti-ASIP bound to myc-tagged protein or ASIP coated wells, anti-mouse IgG labeled with HRP or biotinylated anti-rabbit IgG+SA-RP was added to another set of coated wells, respectively. After approximately 30 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 6. PROTEIN M-ACE2 Fusion Protein (Referred to as armY-ACE2) (SEQ ID NO:9 and 37). Gene Construction of armY-ACE2 (Protein M Fused to Angiotensin I-Converting Enzyme 2 (ACE2))

The mature amino acid sequence of human ACE2 (18-740 amino acid) was generated containing a myc-tag (EQKLISEEDLLRKR) and linker (GSPGGA) sequence at its N-terminal end. The linear amino acid sequence was reverse translated to its corresponding DNA sequence using the free GenSmart™ Codon Optimization Tool by GenScript for expression in human cells (gensmart-free-gene-codon-optimization). This sequence was submitted for gene synthesis and inserted into the plasmid cloning vector pUC57. The insert was amplified and cloned into the mammalian cell expression vector pcDNA3(−) containing the myc-tag-Protein M-HRP sequence (see above) by replacing the myc-tag-HRP sequence with the above myc-tag-ACE2 sequence, upstream of the sequence encoding 3 sets of 4 glycine residues and 1 serine residue (e.g., GGGGS)3 linker followed by the mature amino acid sequence of Protein M (37-556 amino acid), producing a final armY-ACE2 construct containing (IL-2 leader sequence—myc tag—ACE2—linker—Protein M). The plasmid expression vector construct was verified by restriction enzyme analysis, amplified in E. coli and purified using a maxiprep kit (GenScript Inc. and Eton Bioscience, Inc.).

Example 7A. Protein M, Protein M-HRP and armY-ACE2 Gene Expression

The human 293T kidney cell line was transfected with the expression vector encoding the Protein M, Protein M-HRP or armY-ACE2 (Protein M-ACE2) fusion protein sequences, by calcium phosphate transfection method. After 7-16 hours, the transfection solution was replaced with protein expression medium and the supernatant harvested after approximately 48 hours. To purify the proteins, the supernatant was harvested and pass through an anti-myc antibody-coupled agarose resin and the captured proteins eluted using 0.1M Glycine pH 2.5 and neutralized by 1M Tris-HCl pH 8.0. The eluted proteins were dialyzed against a phosphate buffered saline solution and stored in 4° C.

Example 7B. Characterization of armY-ACE2 as a Novel Therapeutic in the Treatment of Coronavirus Infection. Targeting of [armY-ACE2+Antibody] to SARS-CoV-2 Spike Protein and Binding of [armY-ACE2+SARS-CoV-2 Spike Protein] Complex to Antibody

Complex [armY-ACE2+antibody] targeting of SARS-CoV-2 spike protein was demonstrated by measuring the amount of [armY-ACE2+antibody] complexes bound to the SARS-CoV-2 spike protein coated on a 96-well plate by an ELISA-based method. Binding of [armY-ACE2+ to SARS-CoV-2 spike protein] complexes to immobilized antibody was demonstrated by measuring the amount of [armY-ACE2+SARS-CoV-2 spike protein] complexes bound to the antibody coated on a 96-well plate by an ELISA-based method (FIG. 7).

Briefly, 50 ul of 1 ug/ml histidine (his)-tagged SARS-CoV-2 spike protein (GenScript, Inc.) or 5 ug/ml of human IgG (Sigma) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (phosphate buffered saline pH 7.4) (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. [armY-ACE2+antibody] complexes were allowed to form at room temperature by adding 0.25 ug/ml biotinylated goat IgG (Jackson ImmunoResearch Inc.) to armY-ACE2 in expression medium for 60 minutes. Biotinylated antibody was also added to Protein M (lacking ACE2 domain) or expression medium as negative controls. armY-ACE2 alone in expression medium was prepared as an additional negative control. The samples were diluted in assay buffer (0.5% BSA in PBS+Tween 20) and added to SARS-CoV-2 spike protein coated wells, washed twice with PBS+Tween 20 (wash buffer, Pierce), in duplicate. After approximately 30 minutes at room temperature, the wells were washed 3×s and streptavidin-horseradish peroxidase (SA-HRP) (Biolegend, Inc.) in assay buffer was added to the wells and allowed to incubate at room temperature for approximately 20 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software). Binding of [armY-ACE2+ to SARS-CoV-2 spike protein] complexes to immobilized human IgG: armY-ACE2+SARS-CoV-2 spike protein complexes were allowed to form at room temperature by adding 2 ug/ml of SARS-CoV-2 spike protein to armY-ACE2 in expression medium. armY-ACE2 alone in expression medium was prepared as a negative control. After approximately 60 minutes at room temperature, the wells were washed and mouse IgG1 anti-histidine tag (GenScript, Inc.) or mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer was added to detect the histidine-tagged SARS-CoV-2 spike protein or myc-tagged armY-ACE2 bound to human IgG coated on the well, respectively. After approximately 30 minutes, the wells were washed 3×s and anti-mouse IgG labeled with HRP was added to the wells. After approximately 30 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 8. ArmY-ACE2 Binding to Immobilized Antibody or Antibody in Solution

armY-ACE2 binding to immobilized antibody or antibody in solution was demonstrated by measuring the amount of free or antibody-bound armY-ACE2 in an ELISA based method (FIG. 8).

Briefly, 5 ug/ml of human IgG (Sigma) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. Binding of antibody in solution by armY-ACE2: Briefly, armY-ACE2 was incubated with purified human IgG, 2% human serum (containing antibodies) or PBS in assay buffer and allowed to form complexes at room temperature for approximately 2 hours. After washing twice with PBS+Tween 20 (wash buffer, Pierce), samples were added to human IgG-coated wells in duplicate. After approximately 60 minutes at room temperature, the wells were washed and mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer was added to detect the myc-tagged armY-ACE2 bound to human IgG coated on the well. After approximately 30 minutes, the wells were washed 3×s and anti-mouse IgG labeled with HRP was added to the wells. After approximately 30 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 9. [armY-ACE2+Antibody] Complexes Engage Fc Receptors on K562 Erythroleukemic Cell Line

Binding of antibodies to Fc-receptor expressed on cells (e.g., innate immune cells, antigen presenting cells) requires interaction with the antibody Fc region. [armY-ACE2+antibody] complex engagement of Fc receptors was demonstrated by measuring the amount of [armY-ACE2+antibody] complexes bound to the human FcγRII (CD32) expressed on K562, a human erythroleukemic cell line, by flow cytometry (FIG. 9).

Briefly, K562 cells were taken from cell culture medium and centrifuged (3000 rpm for 3 minutes) and supernatant removed by vacuum aspiration. After a wash with chilled FACS buffer (0.5% BSA in PBS+0.1% sodium azide), 100,000 cells was transferred to 1.5 ml microcentrifuge tubes in FACS buffer and the supernatant removed after centrifugation and the cells kept on ice. 5 ug/ml of human IgG (Sigma Aldrich) was added to armY-ACE2 in expression medium and kept at room temperature for approximately 30 minutes to form complexes, and tubes transferred to ice to chill. armY-ACE2 alone in expression medium was also prepared as a negative control. 100 ul of [army-ACE2+antibody] complexes or armY-ACE2 alone was added to K562 cells and allowed to incubate on ice for approximately 30 minutes. After two washes in FACS buffer, anti-myc (clone 9E10 mouse antibody) was added to detect the myc-tagged army-ACE2 and allowed to incubate for approximately 20 minutes. After two washes, anti-mouse IgG-Alexafluor-488 (Biolegend, Inc.) was added to detect anti-myc antibody and allowed to incubate for approximately 20 minutes. After two washes, cells were resuspended in FACS buffer and analyzed by flow cytometry (BD FACS Calibur and CellQuest Pro analysis software). At least 5,000 events were acquired per sample. Cells incubated with negative controls as described above served as source of background basal percent value. The percentage of cells staining positive for [army-ACE2+antibody] complexes was determined by the percentage of cells present within a gate established such that <6% of the positive events of cells incubated with negative control samples measured represented background fluorescence.

To demonstrate that binding of [army-ACE2+antibody] complexes to K562 was through a specific interaction with Fc-receptors expressed on the cells, K562 cells were pre-incubated with FcγRII blocking anti-CD32 (clone IV.3, mouse IgG2b, kappa) (FIG. 10).

Briefly, 1 ug of anti-CD32 or an isotype-matched mouse IgG2b, kappa control antibody was added to 100 ul of FACS buffer and added to approximately 100,000 K562 cells and placed on ice for approximately 15 minutes. After 2 washes, 100 ul of [army-ACE2+antibody] complexes prepared as described above was added to the cells and kept on ice for approximately 20 minutes. After two washes, fluorescein (FITC)-labeled anti-myc (Biotium, Inc.) was added to K562 cells and allowed to incubate on ice for approximately 15 minutes. After two washes, cells were resuspended in FACS buffer and analyzed by flow cytometry (BD FACS Calibur and CellQuest Pro analysis software). At least 5,000 events were acquired per sample. Cells incubated with negative control served as source of background basal percent value. The percentage of cells staining positive for [army-ACE2+antibody] complexes was determined by the percentage of cells present within a gate established such that <2% of the positive events of cells incubated with negative control samples measured represented background fluorescence.

Example 10. [armY-ACE2+Antibody] Complex Binding to Purified Human C1q Complement Component

The binding of the C1q complement component to antibody is the initial step towards the activation of the classical complement pathway. The [armY-ACE2+antibody] complex binding to C1q complement component was demonstrated by measuring the amount of [armY-ACE2+antibody] complexes bound to the purified C1q coated on a 96-well plate by an ELISA-based method (FIG. 11).

Briefly, 50 ul of 5 ug/ml purified human C1q (>95% pure by SDS-PAGE analysis, Complement Technology, Inc.) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. armY-ACE2+antibody complexes were allowed to form at room temperature by adding 5 ug/ml FITC-labeled mouse IgM (Biolegend, Inc. Cat #401607) or 10 ug/ml FITC-labeled mouse IgG1 (Biolegend, Inc. Cat #200305) to armY-ACE2 or Protein M (lacking ACE2 domain) in expression medium. FITC-labeled antibody or armY-ACE2 added to expression medium served as negative controls. To block binding of [armY-ACE2+antibody] complex to immobilized C1q coated on the well, 10 ug/ml of soluble C1q was added to the [armY-ACE2+antibody] complexes and allowed to incubate at room temperature for 30 minutes. C1q-coated wells were washed twice with PBS+Tween 20 (wash buffer, Pierce), and the samples were added in duplicate. After approximately 30 minutes at room temperature, the wells were washed 3×s and biotinylated anti-FITC (Biolegend, Inc.) in assay buffer was added to the wells and allowed to incubate at room temperature for approximately 45 minutes. The wells were washed 3×s and SA-HRP (Biolegend, Inc.) in assay buffer was added to the wells and allowed to incubate at room temperature for approximately 25 minutes. After 4 washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 11. armY-ACE2 or [armY-ACE2+Antibody] Complexes Exhibit ACE2 Activity

ACE2 activity in armY-ACE2 or [armY-ACE2+antibody] complexes was demonstrated by measuring the fluorescence emitted after cleavage of the ACE2 fluorogenic substrate MCA-APK-(Dnp). ACE2-dependent removal of the quenching Dnp group induces fluorescence, which is measured by a fluorescence plate reader (FIG. 12).

Briefly, armY-ACE2 or [armY-ACE2+antibody] complexes were diluted in ACE2 buffer (1 mol/L NaCl, 75 mmol/L Tris HCl, pH 7.5, and 50 μmol/L ZnCl2) and 30 μl of diluted samples were combined with 170 μl the ACE2 fluorogenic substrate MCA-APK(Dnp) (AnaSpec, Inc. Cat #AS-60757) in ACE2 buffer. The final concentration of ACE2 substrate was 20 μM in a final volume of 200 μl. The samples were kept in the dark for 16 hours at room temperature. 100 μl of samples were transferred to a flat bottom NUNC Black 96 Microwell strip plate and fluorescence measured using a fluorescence plate reader (Cytofluor 4000, Gain 75, Ex 360/40, Em 460/40).

Example 12. SDS-PAGE Analysis of Purified armY-ACE2

SDS-PAGE analysis of purified armY-ACE2 was performed under non-reducing and reducing conditions and showed the expected band of ˜180 kDa (theoretical molecular weight: 150 kDa) (FIG. 13).

Briefly, 8 ul of sample buffer (Invitrogen) was added to 24 ul of eluted fractions and mixed. The sample were heated in 80° C. water bath for 10 minutes. Reducing agent (10×, Invitrogen) was added to some of the tubes containing the samples and mixed. Non-reduced and reduced samples were loaded onto a 4-12% NuPAGE pre-cast SDS-PAGE gel and separated at 175V for 30 minutes in MES-SDS running buffer (Invitrogen). PageRuler unstained protein ladder (10-200 kDa, Invitrogen) was also included. After electrophoresis, the gel was rinsed in distilled water and the protein bands stained using SimplyBlue Safe Stain (Invitrogen) and the gel photographed.

Example 13. Evaluating Neutralization of SARS-CoV-2 by armY-ACE2 In-Vitro

Live SARS-CoV-2 virus has to be handled under biosafety level 3 conditions due to its high pathogenicity and infectivity and the lack of effective vaccines and therapeutics. Recently, a VSV pseudovirus production system, a pseudovirus-based neutralization assay has been developed for evaluating neutralizing antibodies against SARS-CoV-2 in biosafety level 2 facilities (Nie et.al., 2020). Pseudoviruses are useful tools because of their safety and versatility, especially for emerging and re-emerging viruses. This example utilizes a validated Pseudovirus neutralization protocol slightly modified from Nie et.al., to test the efficacy of armY-ACE2 by measuring the ability of armY-ACE2 to inhibit SARS-CoV-2 pseudovirus binding and infection of ACE2 expressing cells.

Briefly, the vesicular stomatitis virus (VSV) pseudovirus system (G*AG-VSV) is used, which packages expression cassettes for firefly luciferase instead of VSV-G in the VSV genome. The SARS-CoV-2 pseudovirus is produced by transfecting human 293T cells with the expression plasmid pcDNA3.1 containing the codon-optimized SARS-CoV-2 spike protein sequence, followed by infection with G*AG-VSV pseudovirus. Post infection, SARS-CoV-2 pseudoviruses is harvested and stored until use.

Huh7 human hepatocellular cell line naturally express the human ACE2 receptor protein and is an ideal cell line for SARS-CoV-2 pseudovirus infection as it demonstrates high luciferase activity upon infection. Viral inocula of approximately 650 TCID50 (the 50% tissue culture infectious dose of SARS-CoV-2 pseudovirus) is used for the assay.

Neutralization of SARS-CoV-2 pseudovirus infection of Huh7 is confirmed by the reduction in luciferase gene expression upon infection. Neutralization condition: SARS-CoV-2 pseudovirus is incubated with serial dilutions of armY-ACE2+human plasma containing immunoglobulins (six, 1:3 dilutions, or half-log dilutions) in duplicate. Human plasma added Protein M or human plasma alone are included as negative controls. Recombinant ACE2-Ig fusion protein (commercially available from GenScript Inc., catalog #Z03484) has been demonstrated to neutralize SARS-CoV-2 pseudovirus infection previously (Lei et.al., 2020) and is used in this assay as a positive control. After incubation for 1 hour at 37° C. in a 96-well plate format, 5×10{circumflex over ( )}4 Huh7 cells is added to each well. After 24 hours of incubation in a 5% CO2 chamber at 37° C., luminescence is measured by adding luciferase substrate and the luminescence measured using a 96-well plate luminescence plate reader. Upon subtraction of background luminescence, relative light units (RLU) versus the concentration of test sample and controls is plotted to generate an inhibitory dose response curve from which the IC50 is calculated. Human plasma added armY-ACE2 is neutralizing SARS-CoV-2 pseudovirus infection of Huh7 in a dose-dependent fashion. Human plasma added Protein M or human plasma alone is not neutralizing SARS-CoV-2 pseudovirus infection in this assay.

Example 14. Evaluating Eradication of SARS-CoV-2 by armY-ACE2 In Vivo

While the Example 13 evaluates the efficacy of armY-ACE2 engaged immunoglobulins to neutralize SARS-CoV-2 in vitro, this Example will demonstrate the efficacy of armY-ACE2 to promote eradication of SARS-CoV-2 in vivo, thereby protecting the animal from a severe clinical disease and succumbing to a lethal infection.

Protein M binds to immunoglobulin of various species including those of man and mice. Commercially available human ACE2 transgenic mice K18-hACE2 (The Jackson Laboratory, Stock #034860) develops severe clinical disease upon infection with SARS-CoV (McCray et.al., 2007) to a similar degree observed in patients with severe Covid-19. According to CDC, “Among patients who developed severe disease, the median time to dyspnea from the onset of illness or symptoms ranged from 5 to 8 days, the median time to acute respiratory distress syndrome (ARDS) from the onset of illness or symptoms ranged from 8 to 12 days, and the median time to ICU admission from the onset of illness or symptoms ranged from 10 to 12 days.”

According to JAX laboratory, “These K18-hACE2 mice develop a rapidly lethal infection after intranasal inoculation with a human strain of SARS-CoV. Infection begins in airway epithelia, with subsequent alveolar involvement and extrapulmonary virus spread to the brain. Infection results in macrophage and lymphocyte infiltration in the lungs and upregulation of proinflammatory cytokines and chemokines in both the lung and the brain. By days 3 to 5 postinfection, K18-hACE2 mice begin to lose weight and become lethargic with labored breathing.” K18-hACE2 mice become moribund 4 days after inoculation, and all mice are dead 7 days after inoculation.

Recently, it was determined that K18-hACE2 mice “present with more symptomatic disease than other hACE2 mouse models of SARS-CoV-2 infection.” (Moreau et.al., 2020) For this reason and because the k18-hACE2 mice are readily commercially available, we employed the k18-hACE2 SARS-CoV model to evaluate the efficacy of army-ACE2 engaged immunoglobulins to eradicate SARS-CoV or SARS-CoV-2 in vivo following the methods as described (McCray et.al., 2007) with slight modifications.

Infection of K18-hACE2 mice with SARS-CoV or SARS-CoV-2. SARS-CoV and SARS-CoV-2 strains is obtained from the Centers for Disease Control, Atlanta, Ga. The virus is propagated and titered on Vero E6 cells in a biosafety level 3 laboratory and the virus titer is determined by a plaque assay.

Mice are lightly anesthetized with isoflurane and infected intranasally with the indicated dosage of SARS-CoV or SARS-CoV-2 in 30 ul of Dulbecco's modified Eagle medium. Infected mice are examined, weighed and evaluated for severe clinical disease on a daily basis monitoring for appearances of lethargy, labored breathing, moribund and death.

Treatment with armY-ACE2. Plasma from mice of the same background (C57BL/6J×SJL/J) as K18-hACE2 mice is harvested and mixed with armY-ACE2 and allowed to incubate at 37° C. for 1-2 hours to permit arming of plasma immunoglobulins. Infected mice (n=6) receive daily injections of 0.2 ml of armY-ACE2 plasma beginning one day after infection for 7 days. Two other cohorts of mice receive Protein M+plasma or plasma alone and serve as negative control treatment groups.

To obtain specimens for virus titers, a few animals are sacrificed before injection and after 1, 2, 3, 4, 5 and 6 days after infection, and organs are aseptically removed into sterile phosphate-buffered saline. In some cases, blood is obtained via catheterization of the inferior vena cava. Tissues are homogenized using a manual homogenizer, and the 50% tissue culture infective dose (TCID) is determined as described previously (Subbarao et.al., 2004) to determine the amount of virus per gram of tissue. Mice treated with army-ACE2 plasma do not succumb to infection, whereas mice in the negative control groups succumb. Surviving mice are permitted to continue in the study over an additional 2 months. These mice developed immunity to the virus and are protected from a subsequent challenge with the virus. Surviving mice are re-infected and examined, weighed daily and evaluated.

At termination, whole-lung lavage is performed and the lavage is evaluated for cellular and biochemical changes using standard techniques. Significantly lower cellular infiltrates and inflammatory markers in armY-ACE2 plasma treated mice are found as compared to mice in the negative control groups. Lungs and other organs are examined by histology and immunohistochemistry to evaluate the degree of disease pathology and detect viral antigen. Significantly lower severe disease pathology and viral presence in the lungs and organs or armY-ACE2 plasma treated mice are found as compared to mice in the negative control groups, indicating effective viral clearance and eradication. Similar findings are observed in armY-ACE2 plasma treated mice that had developed immunity to the virus and re-challenged with the virus.

Extraction of total RNA and quantitative reverse transcription-PCR (RTPCR) are performed to measure levels of viral RNA in various tissue specimen. An aliquot of cDNA is subjected to PCR using a MyiQ single-color real-time PCR detection system with iQ SYBR green Supermix. A set of primers is used for the SARS-CoV or SARS-CoV-2 nucleocapsid (N) gene or a house-keeping gene. Significantly lower viral genes in specimens acquired from armY-ACE2 plasma treated mice are found as compared to mice in the negative control groups, indicating effective viral clearance and eradication. Similar findings are found in armY-ACE2 plasma treated mice that had developed immunity to the virus and re-challenged with the virus.

Example 15. Identification of Two Potential Immunogenic Peptide Regions in Protein M (a.a. 469-556)

The online B-cell epitope prediction tools (IEDB Analysis Resource) were used to determine potential immunogenic peptide regions in Protein M (a.a. 469-556). Using six online prediction tools [Bepipred Linear Epitope Prediction 2.0, Bepipred Linear Epitope Prediction, Chou & Fasman Beta-Turn Prediction, Emini Surface Accessibility Prediction, Karplus & Schulz Flexibility Prediction and Parker Hydrophilicity Prediction] two peptide regions in protein M c-terminal end (469-556 amino acid) were determined to be potentially immunogenic. The following peptide substitutions are proposed for these two regions to mitigate immunogenicity of Protein M C-terminal end (469-556 amino acid), which are listed below in a) an b), and additionally shown in the following Table 1. Complete Protein M amino acid sequences with substitutions are presented as SEQ ID NO: 63-66.

a) 494-507 amino acid of protein M (based on SEQ ID NO: 1) 1. QALANATASALAAM 2. AKLANATASALARM 3. QALEADADSALEAM 4. AKLANDTASSAERA b) 527-540 amino acid of protein M (based on SEQ ID NO: 1) 1. AIAGVASATNAVAS 2. AIAGVASATNAVKS 3. DIAGVSADTAEVAS 4. AITGASSATNAVKA

TABLE 1 Proposed alanine substitutions for Protein M c-terminal end (469-556 amino acid) to mitigate immunogenicity. a) 494 495 496 497 498 499 500 501 502 503 504 505 506 507 Original Q K L E N D T D S S L E R M Subs #1 Q A L A N A T A S A L A A M Subs #2 A K L A N A T A S A L A R M Subs #3 Q A L E A D A D S A L E A M Subs #4 A K L A N D T A S S A E R A b) 527 528 529 530 531 532 533 534 535 536 537 538 539 540 Original D I T G V S S D T N E V K S Subs #1 A I A G V A S A T N A V A S Subs #2 A I A G V A S A T N A V K S Subs #3 D I A G V S A D T A E V A S Subs #4 A I T G A S S A T N A V K A

Example 16. Non-Immune Human Serum or Plasma Antibodies Armed with armY-ACE2 Bind to SARS-CoV-2 Spike Protein (FIG. 14A-C)

Binding to SARS-CoV-2 spike protein by armY-ACE2 armed non-immune serum antibodies was demonstrated by measuring the amount of armed antibodies that bind the SARS-CoV-2 spike protein coated on a 96-well plate by an ELISA-based method.

Briefly, 50 ul of 5 ug/ml SARS-CoV-2 spike protein was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed 2×s with PBS (phosphate buffered saline pH 7.4) (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. [armY-ACE2+antibody] complexes were allowed to form by mixing armY-ACE2 with non-immune serum (pre-vaccine) diluted 1:200 in assay medium for 60 minutes in a 37° C. incubator. Mixtures containing pre-vaccine serum diluted 1:200 in assay medium, post-vaccine (Moderna SARS-CoV-2 spike mRNA vaccine) serum diluted 1:200 in assay medium or armY-ACE2 in assay medium, were included as controls and placed in a 37° C. incubator for 60 minutes. The final concentration of armY-ACE2 was 20 ug/ml. Binding of armY-ACE2 armed non-immune plasma (ACD-A) antibodies to SARS-CoV-2 spike protein was demonstrated following the same procedure described above.

After the incubation period, the samples were added to SARS-CoV-2 spike protein coated wells that had been washed 2×s with PBS+Tween 20 (wash buffer, Pierce), in duplicate. After approximately 120 minutes at room temperature, the wells were washed 4×s and anti-human IgG labeled with HRP (Genscript) was added to the wells and allowed to incubate at room temperature for approximately 25 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and the blue color absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software) and a photo taken with a digital camera.

After incubating serum- or plasma-antibodies with armY-ACE2, the amount of free armY-ACE2 was determined by measuring the amount of unengaged armY-ACE2 that can bind to immobilized human IgG. Briefly, serum- or plasma samples incubated with armY-ACE2 as described above, were added to human IgG coated wells, in duplicate. 1 ug/ml of armY-ACE2 alone in assay buffer was added to separate wells as a reference positive control. After approximately 100 minutes at room temperature, the wells were washed and mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer was added to detect the myc-tagged armY-ACE2 bound to human IgG coated on the wells. After approximately 30 minutes, the wells were washed 3×s and anti-mouse IgG labeled with HRP was added to the wells. After approximately 20 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 17. Monoclonal Antibody (mAb) Armed with armY-ACE2 Gains the Ability to Bind the SARS-CoV-2 Spike Protein, but is No Longer Able to Bind its Natural Target Antigen (FIG. 15 A-B)

Binding to SARS-CoV-2 spike protein by armY-ACE2 armed mAb (originally anti-selectin) was demonstrated by measuring the amount of armed mAbs that bind to the SARS-CoV-2 spike protein coated on a 96-well plate by an ELISA-based method.

Briefly, 50 ul of 5 ug/ml SARS-CoV-2 spike protein was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed 2×s with PBS (phosphate buffered saline pH 7.4) (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. [armY-ACE2+mAb] complexes were allowed to form by mixing armY-ACE2 with the mAb in assay medium for 120 minutes in a 37° C. incubator. Mixtures containing mAb in assay medium, armY-ACE2 in assay medium, or assay medium alone were included as controls and placed in a 37° C. incubator for 120 minutes. The final concentration of mAb and armY-ACE2 were 1 ug/ml and 30 ug/ml, respectively.

After the incubation period, the samples were added to SARS-CoV-2 spike protein coated wells, in duplicate, that had been washed 2×s with PBS+Tween 20 (wash buffer, Pierce). After approximately 120 minutes at room temperature, the wells were washed 3×s and anti-human IgG labeled with HRP (Southern Biotech) was added to the wells and allowed to incubate at room temperature for approximately 45 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

The inability of armY-ACE2 armed mAb (originally anti-selectin) to bind to its natural antigen was demonstrated by measuring the amount of armed mAb that bind to selectin protein coated on a 96-well plate by an ELISA-based method.

Briefly, 100 ul of 2 ug/ml biotinylated selectin protein was prepared in PBS and added to a flat bottom 96-well plate (streptavidin coated wells) after 2 washes with PBS+Tween 20 (wash buffer, Pierce).

[armY-ACE2+mAb] complexes were allowed to form by mixing armY-ACE2 with the mAb in assay medium for 120 minutes in a 37° C. incubator. Mixtures containing mAb in assay medium or assay medium alone were included as controls and placed in a 37° C. incubator for 120 minutes. The final concentration of mAb and armY-ACE2 were 63 ng/ml and 15 ug/ml (50× molar excess) or 7.5 ug/ml (25× molar excess), respectively.

After 2 hours, the selectin-coated wells were washed 2×s with wash buffer and the mixtures added in duplicate wells and allowed to incubate for 1 hour at room temperature. After the incubation period, the wells were washed 3×s. After approximately 60 minutes at room temperature, the wells were washed 3×s and anti-human IgG labeled with HRP (Southern Biotech)+2% mouse serum was added to the wells and allowed to incubate at room temperature for approximately 60 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 18. Biotinylated Protein M Detects the Light-Chain of Antibody, but not the Heavy-Chain on Western Blot (FIG. 16)

Protein M containing an N-terminal biotinylation “Avi-Tag” sequence was mono-biotinylated using the Accelagen TurboBiotinylation kit following the reaction protocol (Accelagen, TurboBiotinylation-protocol). The use of mono-biotinylated protein M fusion as an immunologic research tool for detection of antibody light-chain was demonstrated using a 1D gel electrophoresis Western blot method.

Briefly, the antibody sample was diluted to 1.0 mg/mL with sodium dodecyl sulfate (SDS) boiling buffer and heated to 95° C. for 10 minutes, and further diluted to 0.01 mg/mL. The E. coli (K12 MG1655) lysate sample was diluted to 2.5 mg/mL in SDS boiling buffer. SDS slab gel electrophoresis was carried out under reducing conditions according to the method of Laemmli, U. (Nature 227: 680-685, 1970) as modified by O'Farrell (J Biol. Chem. 250: 4007-4021). The samples were loaded in wells in 10% acrylamide slab gels (0.75 mm thick). SDS slab gel electrophoresis was carried out for about 4 hours at 15 mA/gel. The following proteins (Millipore Sigma) were used as molecular weight standards: myosin (220,000), phosphorylase A (94,000), catalase (60,000), actin (43,000), carbonic anhydrase (29,000), and lysozyme (14,000, not shown). After slab gel electrophoresis, the gel for blotting was placed in transfer buffer (10 mM CAPS, pH 11.0, 10% methanol) and transblotted onto PVDF membranes overnight at 145 mA and approximately 100 volts/two gels. The blots were stained with Coomassie Brilliant Blue R-250, cut into pieces at the dark lines and flatbed scanned (not shown).

Western Blot analysis. The membrane sections were destained in 100% MeOH and rinsed briefly in Tween-20 tris buffer saline (TTBS). The blot was blocked for two hours in Superblock with 0.05% Tween-20 (Superblock-T). The blot was then incubated overnight in Superblock-T and rinsed 3×10 minutes in TTBS. The blot was then placed in mono-biotinylated protein M diluted to 1.0 μg/ml in Superblock-T for two hours and rinsed as above. The blot was then placed in poly-HRP streptavidin (ThermoFisher, Cat #N200) diluted 1:500,000 in Superblock-T for two hours, rinsed as above, treated with ThermoFisher Pierce ECL, and exposed to x-ray film for 3 minutes.

Example 19. Protein M Fusion Serves as a Surrogate Antigen and May be Used to Confirm that a) Antigen Binding is Via the Fab Domain of the Antibody and b) the Antibody's Target Antigen and/or Specificity (FIG. 17)

Binding of antibody to antigen is mediated by the Fab arm of the antibody, which contains the variable region where the antigen binding site is found. Protein M binds specifically to the light-chain variable region in the Fab and blocks the antigen binding site. Therefore, protein M may serve as a surrogate antigen and as an immunologic research tool, and used to a) confirm that the antibody binds via its Fab domain and b) confirms its specificity as it loses its antigen binding ability when bound by protein M. A 96-well ELISA-based method was used to demonstrate such protein M uses.

Protein M fusion serves as a surrogate antigen.

Briefly, 100 ul of 2 ug/ml mono-biotinylated protein M was prepared in PBS and added to a flat bottom 96-well plate (streptavidin coated wells) after 2 washes with PBS+Tween 20 (wash buffer, Pierce).

After approximate 2 hours at room temperature, the coated wells were washed 2×s with wash buffer and the varying amounts of monoclonal antibody (originally anti-selectin) were added in duplicate wells and allowed to incubate for approximately 2 hours at room temperature. After the incubation period, the wells were washed 3×s and anti-human IgG labeled with HRP (Southern Biotech) was added to the wells and allowed to incubate at room temperature for approximately 60 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 20. Protein M is Used to Confirm that a) the Antibody Binds Via its Fab Domain and b) it Loses its Antigen Binding Ability when Bound by Protein M (FIG. 18)

Briefly, 100 ul of 2 ug/ml biotinylated selectin protein was prepared in PBS and added to a flat bottom 96-well plate (streptavidin coated wells) after 2 washes with PBS+Tween 20 (wash buffer, Pierce).

Protein M+monoclonal antibody (mAb) complexes were allowed to form by mixing protein M with the mAb in assay medium for 120 minutes in a 37° C. incubator. Mixtures containing mAb in assay medium or assay medium alone were included as controls and placed in a 37° C. incubator for 120 minutes. The final concentration of mAb and protein M were 125 ng/ml and 3.4 ug/ml (50× molar excess) or 1.7 ug/ml (25× molar excess), respectively.

After 2 hours, the selectin-coated wells were washed 2×s with wash buffer and the mixtures added in duplicate wells and allowed to incubate for approximately 90 minutes at room temperature. After the incubation period, the wells were washed 3×s. After approximately 60 minutes at room temperature, the wells were washed 3×s and anti-human IgG labeled with HRP (Southern Biotech) was added to the wells and allowed to incubate at room temperature for approximately 60 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Other Embodiments

The detailed description set-forth above is provided to aid those skilled in the art in practicing the present invention. However, the invention described and claimed herein is not to be limited in scope by the specific embodiments herein disclosed because these embodiments are intended as illustration of several aspects of the invention. Any equivalent embodiments are intended to be within the scope of this invention. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description which do not depart from the spirit or scope of the present inventive discovery. Such modifications are also intended to fall within the scope of the appended claims.

REFERENCES CITED

All publications, patents, patent applications and other references cited in this application are incorporated herein by reference in their entirety for all purposes to the same extent as if each individual publication, patent, patent application or other reference was specifically and individually indicated to be incorporated by reference in its entirety for all purposes. Citation of a reference herein shall not be construed as an admission that such is prior art to the present invention.

Specifically intended to be within the scope of the present invention, and incorporated herein by reference in its entirety, is the following publication:

-   Grover R K, Zhu X, Nieusma T, et al. A structurally distinct human     mycoplasma protein that generically blocks antigen-antibody union.     Science. 2014; 343(6171):656-661. -   Blötz C, Singh N, Dumke R, Stülke J. Characterization of an     Immunoglobulin Binding Protein (IbpM) From Mycoplasma pneumoniae.     Front Microbiol. 2020; 11:685. Lei C, Qian K, Li T, et al.     Neutralization of SARS-CoV-2 spike pseudotyped virus by recombinant     ACE2-Ig. Nat Commun. 2020; 11(1):2070. -   Nie J, Li Q, Wu J, et al. Establishment and validation of a     pseudovirus neutralization assay for SARS-CoV-2. Emerg Microbes     Infect. 2020; 9(1):680-686. -   Mccray P B, Pewe L, et al. Lethal infection of K18-hACE2 mice     infected with severe acute respiratory syndrome coronavirus. J     Virol. 2007; 81(2):813-21. -   G. Brett Moreau, Stacey L. Burgess, Jeffrey M. Sturek, Alexandra N.     Donlan, William A. Petri Jr., Barbara J. Mann Evaluation of     K18-hACE2 mice as a model of SARS-CoV-2 infection. bioRxiv     2020.06.26.171033. -   Subbarao K, Mcauliffe J, Vogel L, et al. Prior infection and passive     transfer of neutralizing antibody prevent replication of severe     acute respiratory syndrome coronavirus in the respiratory tract of     mice. J Virol. 2004; 78(7):3572-7. -   Mebrahtu E, Zheleznyak A, Hur M A, Laforest R, Lapi S E. Initial     characterization of a dually radiolabeled peptide for simultaneous     monitoring of protein targets and enzymatic activity. Nucl Med Biol.     2013; 40(2):190-6.

SEQUENCE LISTING 1) Mycoplasma genitalium The mature protein M sequence (37-556 amino acid) SEQ ID NO: 1 TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVK LKSLSLNDGSYQSEIDLSGGANFREKFRNFANELS EAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFK AGYYDHVASDGSLLSYYQSTEYFNNRVLMPILQTT NGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTS NNLTYDKWTYFAAKGSPLYDSYPNHFFEDVKTLAI DAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNEL QLPESVKKVSLYGDYTGVNVAKQIFANVVELEFYS TSKANSFGFNPLVLGSKTNVIYDLFASKPFTHIDL TQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQF QGYFAGGYIDKYLVKNVNTNKDSDDDLVYRSLKEL NLHLEEAYREGDNTYYRVNENYYPGASIYENERAS RDSEFQNEILKRAEQNGVTFDENIKRITASGKYSV QFQKLENDTDSSLERMTKAVEGLVTVIGEEKFETV DITGVSSDTNEVKSLAKELKTNALGVKLKL 2) Mycoplasma pneumoniae IgG-blocking mature protein M sequence (36-582 amino acid) SEQ ID NO: 2 AVLIVNEVLRLQSGETLIASGRSGNLSFQLYSKVN QNAKSKLNSISLTDGGYRSEIDLGDGSNFREDFRN FANNLSEAITDAPKDLLRPVPKVEVSGLIKTSSTF ITPNFKAGYYDQVAADGKTLKYYQSTEYFNNRVVM PILQTTNGTLTANNRAYDDIFVDQGVPKFPGWFHD VDKAYYAGSNGQSEYLFKEWNYYVANGSPLYNVYP NFIHFKQIKTIAFDAPRIKQGNTDGINLNLKQRNP DYVIINGLTGDGSTLKDLELPESVKKVSIYGDYHS INVAKQIFKNVLELEFYSTNQDNNFGFNPLVLGDH TNIIYDLFASKPFNYIDLTSLELKDNQDNIDASKL KRAVSDIYIRRRFERQMQGYWAGGYIDRYLVKNTN EKNVNKDNDTVYAALKDINLHLEETYTHGGNTMYR VNENYYPGASAYEAERATRDSEFQKEIVQRAELIG VVFEYGVKNLRPGLKYTVKFESPQEQVALKSTDKF QPVIGSVTDMSKSVTDLIGVLRDNAEILNITNVSK DETVVAELKEKLDRENVFQEIRT 3) Mycoplasma iowae IgG-blocking mature protein M sequence (21-497 amino acid) SEQ ID NO: 3 VGVYVATTNTQNTSVNVNNNENINYKTNGTVVTGD KLTFSAVVQQNSNISTQAFINDGTKPVGTYNKEIN LGKDSITPKYTSGYVETYLESGDTVSRYSSSEYHN NRTLMPILDTKEHYYTSERTYSEIQKGIYRGWEIS TKSINYGEQFAYSASPVLKTVFRDLKQETIKAVQF NLGLSDTSIESINSFLKTNTGIQFVTIKGISQDTD LSKLVLPESVQKLTLLGQRNTINDLKLPSELQEIE IYLGSSLKSIDPLIFPKSANIISDVVMNNTSSVFT EIKLSDSTIDNNSPKLQKAIDDVYTYRIKERAFQG LVPGGYIASWDLTGTKVTSFNNVNIPPLNDGTGRF YIAHVEVKTDGNFGNSQNESIGSKPSNDSQINDWF DWGGGWQKVQEVVVSSSENVSLETATQEIMGFIAK YPNVKKINIVNVKLTDGSTHEQLKDNVIKAITAKY GEESQYKDIEFVLPETVPSPVA 4) Mycoplasma tullyi IgG-blocking mature protein M sequence (30-517 amino acid) SEQ ID NO: 4 IVYTSVKISNTLNQDKQIAGSNLSPTQSNRLIGFQ TLTKFKIQDLDFELQRKIYSSRLNSAELITKSAVV LDQSTLQNHDGEVASGQPAPQVPPPVRIPAKEQTG HTSDFISGYSENNLYYQTPYYYNDRVYMPILDSRK TYLRNERTTTDIGLNNYEGWITSDHSRVNNRVNVF NYRPSPELLAKYTDLAADKLIFTMTIDLYQANPEM INEILKEYSPDFVILSNADSQVMKQLVFPSSVKKL TIKSNLLDRFDFSLANTEIQELELYTPRLTEYNPF ALNPNTHLIFDSNYSKPFTSINLYGVPLTHQQVLS ALEDVFVRRHYERALQGSFSGGYISSLDLSNTGIT SLSNLMIKNINPYYDSYTMSVKYNSNKNGEIELLK TNSWKNPNPAPVSTPAASSPTTPTVPSTPGDSTIN VQDKDLGLLVSSEVKVDPQVLINVVSKYLHNNPRV NVLDISKVSLKSGSLVDVATNLKAKIDYLNVTI 5) Mycoplasma imitans IgG-blocking mature protein M sequence (29-507 amino acid) SEQ ID NO: 5 GIIYTSVKISSSQFNKQISNPIEVPKRNNTLIGFQ TLARFKIENLDFELQKNIYSQNENALVNKAAVVQD NSIINHDGEPTGQNERQVPAPVKILAKEQTGHTSD FISGYTDNNSYYQSPFYYNDRVFMPILDSHSIYLK NERTSKEIGLDSYEGWDKIGYSTINSRVSFVQYRA TDQLIAKFNPSNKQIFAMMINLYQADPAVINNTLR NYLPDFVILSNADNQIIKRLVFPSSVKKLTIKSNL LDRFDFSLANSNIQELELYTPNLTEYNPLALNPDT HLIFDTAYSKPFTSINLYGAKLTTQETQEAFNDIF VRRYYERYLQGAFVGGYISLLDLSNTGINSVNDYV VKNINPAYSSYTLSVTYNPGDPGQISILRTTTSIP SETQPTNPSNNTPSQPTDPNITTQIDAKEKDLKLV VSSTIQVDTQVVINVVGKYLLNNPRVNNVDISRIQ LKSGTLVDIANNFKTKMSYLNVSV 6) Mycoplasma gallisepticum IgG-blocking mature protein M sequence (29-417 amino acid) SEQ ID NO: 6 GIIYTSVKISNSLYQDKLISGQNQPLAPVNRLIGF QTLAKFRIEGLDFELKKKIYSSTVESVELVNRSAV LVDDSVLENHDGELTSVQSDPQVPAPVKILAKEQT GHTSDFVSGYSDDNKYYQSPYYYNDRVYMPILDSP TIYLKNERTSSDIGLNNYQGWIAVGHARVNSRVSV FNYRATDELLAKFNNLPDRLIFTMSIDLYQANPAM INETLKEYSPDFVILSNADSQTMKQLVFPSSVKKL TIKSNILDRFDFSLVNSEIQELELYTPNLTEYNPL ALNPKTHLIFDADYSTRFLSINLYGAQLTNQQALA ALEDVFVHRYYERALQGSFVDGYISSLVLSDTGIT SLNNLVIKNINPNYDSYIMSVKYHSNDSGQIELLK TTAW 7) Mycoplasma alvi IgG-blocking protein M sequence, signal peptide included (28-540 amino acid) SEQ ID NO: 7 ISIPFIIQSTHTNNANSTIPNVSKPSGSSLAPINY SYDNFVNNYDGTLTSNSLVFSASGSKEVKSSLQTR AITVDGLNDIDSSMGLVDAMSQGLLDNSYDPKYNE VREVIDMDGAHRKIVTTKCFDNNRKYMPILTYNND TYYSYSESRTWDDVNRSIYPGWNLNRSNLSSHNQN KMIGVDILVYTPTEVLKTAYPSVTDKIIGLSISLS NLISTYGDQTKQVLSQLIDAVNPSLVNFWGVSDSN LDKLPDLSSNTNIKKISIRGDYSNLNGFVFPSSVL ELEFSSQNYKAVDPLQIPESAAIIYEQGYSSYFTS IDLSTHKGMSNEDLQKAVNVVYQQRIHERAFQGDF AGGYIYSWNLRNTGIYSFNNVTIPMLTDGTGRFYI AYVAVETDGNQGPIANEVISDNSSKPSNDSQINEW FDWNQNGWSTITEVKITAKDNVKLNFNNTVQEILG FINKYPNIKVVDISALQFSNDETLDELIDAVNKAI ADKYTGMDGTPTVKLDFIKVNYL 8) Mycoplasma penetrans IgG-blocking mature protein M sequence (31-505 amino acid) SEQ ID NO: 8 LVTSNNNHENSLNNSSSNNGSNLKVNGSVISTDNL NIVATGLSSNVSSQVSRQSLSSSSSSESTVDSKYT AKKKLTTVSGQEKEYLVSTVYENNRKFMPILAYDE DISYNNYQQSREYKDVVYGNFPGWDKKVAVVHQID NVDLSKAYASVAEFTPTEILKKHFQVLQTSVKQLY VALDSKTMTADVITKLVDRYQPDYLRIESVDDTSI KQLPDMKYFSTVKKVDLGGAFTTIKGVSFPTTTQE LKISSDNIKSIDPLQIPESAAIITETVHDARFTEI DLSSHTDLTTDQLQKAVNIVYKDRIKERAFQGNFA GGYIYSWNLQNTGITSFNDVSIPKLNDGTDRFYIA YVAVSSGNSNGTANETITGGKEPSNDSQIGEWWDS SSDGWSKVSKVTVTAKNGASLDYNKTLTEIMGFLA KYPNVKTIDISLLKFEDASKTLDGLKTELTNQIKS KYGEDSSYAKIDFIITSQSN 9) Artificial armY-ACE2 fusion protein sequence (1,298 amino acids). Including the human IL-2 signal sequence, human myc-peptide epitope tag, linker, human ACE2, linker, Mycoplasma genitalium protein M SEQ ID NO: 9 MYRMQLLSCIALSLALVTNSEQKLISEEDLLRKRG SPGGAQSTIEEQAKTFLDKFNHEAEDLFYQSSLAS WNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQM YPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLN TILNTMSTIYSTGKVCNPDNPQECLLLEPGLNEIM ANSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLK NEMARANHYEDYGDYWRGDYEVNGVDGYDYSRGQL IEDVEHTFEEIKPLYEHLHAYVRAKLMNAYPSYIS PIGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNID VTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQGF WENSMLTDPGNVQKAVCHPTAWDLGKGDFRILMCT KVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGAN EGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDN ETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGE IPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPAS LFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEG PLHKCDISNSTEAGQKLFNMLRLGKSEPWTLALEN VVGAKNMNVRPLLNYFEPLFTWLKDQNKNSFVGWS TDWSPYADQSIKVRISLKSALGDKAYEWNDNEMYL FRSSVAYAMRQYFLKVKNQMILFGEEDVRVANLKP RISFNFFVTAPKNVSDIIPRTEVEKAIRMSRSRIN DAFRLNDNSLEFLGIQPTLGPPNQPPVSGGGGSGG GGSGGGGSTNLVNQSGYALVASGRSGNLGFKLFST QSPSAEVKLKSLSLNDGSYQSEIDLSGGANFREKF RNFANELSEAITNSPKGLDRPVPKTEISGLIKTGD NFITPSFKAGYYDHVASDGSLLSYYQSTEYFNNRV LMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNT KATTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFF EDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSG NGSQLNELQLPESVKKVSLYGDYTGVNVAKQIFAN VVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFAS KPFTHIDLTQVTLQNSDNSAIDANKLKQAVGDIYN YRRFERQFQGYFAGGYIDKYLVKNVNTNKDSDDDL VYRSLKELNLHLEEAYREGDNTYYRVNENYYPGAS IYENERASRDSEFQNEILKRAEQNGVTFDENIKRI TASGKYSVQFQKLENDTDSSLERMTKAVEGLVTVI GEEKFETVDITGVSSDTNEVKSLAKELKTNALGVK LKL 10) Artificial Protein M with peptide tags (aka: armY) protein sequence (587 amino acids). Including the human IL-2 signal sequence. Avi-Tag, human myc- peptide epitope tag, linker. Mycoplasma genitalium protein M SEQ ID NO: 10 MYRMQLLSCIALSLALVTNSMAGGLNDIFEAQKIE WHEGGEQKLISEEDLLRKRAANGGGGSGGGGSTNL VNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKS LSLNDGSYQSEIDLSGGANFREKFRNFANELSEAI TNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGY YDHVASDGSLLSYYQSTEYFNNRVLMPILQTTNGT LMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNL TYDKWTYFAAKGSPLYDSYPNHFFEDVKTLAIDAK DISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSK ANSFGFNPLVLGSKTNVIYDLFASKPFTHIDLTQV TLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGY FAGGYIDKYLVKNVNTNKDSDDDLVYRSLKELNLH LEEAYREGDNTYYRVNENYYPGASIYENERASRDS EFQNEILKRAEQNGVTFDENIKRITASGKYSVQFQ KLENDTDSSLERMTKAVEGLVTVIGEEKFETVDIT GVSSDTNEVKSLAKELKTNALGVKLKL 11) Artificial Protein M horseradish peroxidase (HRP) fusion protein sequence (876 amino acids). Including the human IL-2 signal sequence, human myc-peptide epitope, linker. HRP, linker, Mycoplasma genitalium protein M SEQ ID NO: 11 MYRMQLLSCIALSLALVTNSEQKLISEEDLAANQL TPTFYDNSCPNVSNIVRDTIVNELRSDPRIAASIL RLHFHDCFVNGCDASILLDNTTSFRTEKDAFGNAN SARGFPVIDRMKAAVESACPRTVSCADLLTIAAQQ SVTLAGGPSWRVPLGRRDSLQAFLDLANANLPAPF FTLPQLKDSFRNVGLNRSSDLVALSGGHTFGKNQC RFIMDRLYNFSNTGLPDPTLNTTYLQTLRGLCPLN GNLSALVDFDLRTPTIFDNKYYVNLEEQKGLIQSD QELFSSPNATDTIPLVRSFANSTQTFFNAFVEAMD RMGNITPLTGTQGQIRLNCRVVNSNSGGGGSGGGG SGGGGSTNLVNQSGYALVASGRSGNLGFKLFSTQS PSAEVKLKSLSLNDGSYQSEIDLSGGANFREKFRN FANELSEAITNSPKGLDRPVPKTEISGLIKTGDNF ITPSFKAGYYDHVASDGSLLSYYQSTEYFNNRVLM PILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKA TTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFFED VKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNG SQLNELQLPESVKKVSLYGDYTGVNVAKQIFANVV ELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKP FTHIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYR RFERQFQGYFAGGYIDKYLVKNVNTNKDSDDDLVY RSLKELNLHLEEAYREGDNTYYRVNENYYPGASIY ENERASRDSEFQNEILKRAEQNGVTFDENIKRITA SGKYSVQFQKLENDTDSSLERMTKAVEGLVTVIGE EKFETVDITGVSSDTNEVKSLAKELKTNALGVKLK L 12) Artificial Set of three Glycine (G₄)-Serine (S₁) linker sequence (1-15 amino acid) SEQ ID NO: 12 GGGGSGGGGSGGGGS 13) Artificial Set of two Glycine (G₄)-Serine (S₁) linker sequence (1-10 amino acid) SEQ ID NO: 13 GGGGSGGGGS 14) SEQ ID NO: 14 Artificial Set of one Glycine (G₄)-Serine (S₁) linker sequence (1-5 amino acid) GGGGS 15) SEQ ID NO:  15

Human

Angiotensin-Converting Enzyme 2 (ACE2) Extracellular Domain Protein Sequence (18-740 Amino Acid)

Essential counter-regulatory carboxypeptidase of the renin-angiotensin hormone system that is a critical regulator of blood volume, systemic vascular resistance, and thus cardiovascular homeostasis. This receptor acts as an attachment receptor for human coronaviruses SARS-CoV and SARS-CoV-2, as well as human coronavirus NL63/HCoV-NL63

16) SEQ ID NO: 16 QSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNT NITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQE IQNLTVKLQLQALQQNGSSVLSEDKSKRLNTILNT MSTIYSTGKVCNPDNPQECLLLEPGLNEIMANSLD YNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMAR ANHYEDYGDYWRGDYEVNGVDGYDYSRGQLIEDVE HTFEEIKPLYEHLHAYVRAKLMNAYPSYISPIGCL PAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAM VDQAWDAQRIFKEAEKFFVSVGLPNMTQGFWENSM LTDPGNVQKAVCHPTAWDLGKGDFRILMCTKVTMD DFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHE AVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEIN FLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQ WMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVS NDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKC DISNSTEAGQKLFNMLRLGKSEPWTLALENVVGAK NMNVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSP YADQSIKVRISLKSALGDKAYEWNDNEMYLFRSSV AYAMRQYFLKVKNQMILFGEEDVRVANLKPRISFN FFVTAPKNVSDIIPRTEVEKAIRMSRSRINDAFRL NDNSLEFLGIQPTLGPPNQPPVS

Human

CD209 (DC-SIGN) Extracellular Domain Protein Sequence (59-404 Amino Acid).

A pathogen-recognition receptor expressed on the surface of immature dendritic cells (DCs) and involved in initiation of primary immune response. This receptor acts as an attachment receptor for HIV-1, HIV-2, Ebolavirus, Cytomegalovirus, HCV, Dengue virus, Measles virus, Herpes simplex virus 1, Influenza virus, SARS-CoV, Japanese encephalitis virus, Lassa virus, Respiratory syncytial virus, Rift valley fever virus, West-nile virus, Marburg virus, Uukuniemi virus, and Yersinia Pestis

17) SEQ ID NO: 17 QVSKVPSSISQEQSRQDAIYQNLTQLKAAVGELSE KSKLQEIYQELTQLKAAVGELPEKSKLQEIYQELT RLKAAVGELPEKSKLQEIYQELTWLKAAVGELPEK SKMQEIYQELTRLKAAVGELPEKSKQQEIYQELTR LKAAVGELPEKSKQQEIYQELTRLKAAVGELPEKS KQQEIYQELTQLKAAVERLCHPCPWEWTFFQGNCY FMSNSQRNWHDSITACKEVGAQLVVIKSAEEQNFL QLQSSRSNRFTWMGLSDLNQEGTWQWVDGSPLLPS FKQYWNRGEPNNVGEEDCAEFSGNGWNDDKCNLAK FWICKKSAASCSRDEEQFLSPAPATPNPPPA

Human

C-Type Lectin Domain Family 4 Member M Extracellular Domain Protein Sequence (71-399 Amino Acid).

Probable pathogen-recognition receptor involved in peripheral immune surveillance in liver. This receptor acts as an attachment receptor for Ebolavirus, Hepatitis C virus, HIV-1, Human coronavirus 229E, Human cytomegalovirus/HHV-5, Influenza virus, SARS-CoV, West-nile virus, Japanese encephalitis virus, Marburg virus glycoprotein, and M. bovis.

18) SEQ ID NO: 18 QVSKVPSSLSQEQSEQDAIYQNLTQLKAAVGELSE KSKLQEIYQELTQLKAAVGELPEKSKLQEIYQELT RLKAAVGELPEKSKLQEIYQELTRLKAAVGELPEK SKLQEIYQELTRLKAAVGELPEKSKLQEIYQELTE LKAAVGELPEKSKLQEIYQELTQLKAAVGELPDQS KQQQIYQELTDLKTAFERLCRHCPKDWTFFQGNCY FMSNSQRNWHDSVTACQEVRAQLVVIKTAEEQNFL QLQTSRSNRFSWMGLSDLNQEGTWQWVDGSPLSPS FQRYWNSGEPNNSGNEDCAEFSGSGWNDNRCDVDN YWICKKPAACFRDE

Human

CD4 Extracellular Domain Protein Sequence (26-396 Amino Acid).

Integral membrane glycoprotein that plays an essential role in the immune response and serves multiple functions in responses against both external and internal offenses. In T-cells, functions primarily as a coreceptor for MHC class II molecule:peptide complex. This coreceptor acts as an attachment receptor for HIV.

19) SEQ ID NO: 19 KKVVLGKKGDTVELTCTASQKKSIQFHWKNSNQIK ILGNQGSFLTKGPSKLNDRADSRRSLWDQGNFPLI IKNLKIEDSDTYICEVEDQKEEVQLLVFGLTANSD THLLQGQSLTLTLESPPGSSPSVQCRSPRGKNIQG GKTLSVSQLELQDSGTWTCTVLQNQKKVEFKIDIV VLAFQKASSIVYKKEGEQVEFSFPLAFTVEKLTGS GELWWQAERASSSKSWITFDLKNKEVSVKRVTQDP KLQMGKKLPLHLTLPQALPQYAGSGNLTLALEAKT GKLHQEVNLVVMRATQLQKNLTCEVWGPTSPKLML SLKLENKEAKVSKREKAVWVLNPEAGMWQCLLSDS GQVLLESNIKVLPTWSTPVQP

Human

Synaptic Vesicle Glycoprotein 2A Extracellular Domain Protein Sequence (469-598 Amino Acid).

Plays a role in the control of regulated secretion in neural and endocrine cells, enhancing selectively low-frequency neurotransmission. This protein acts as an attachment receptor for the C. botulinum neurotoxin type A2 (BoNT/A, botA).

20) SEQ ID NO: 20 PDMIRHLQAVDYASRTKVFPGERVEHVTFNFTLEN QIHRGGQYFNDKFIGLRLKSVSFEDSLFEECYFED VTSSNTFFRNCTFINTVFYNTDLFEYKFVNSRLIN STFLHNKEGCPLDVTGTGEGAYMVY

Human

Synaptic Vesicle Glycoprotein 2B Extracellular Domain Protein Sequence (412-535 Amino Acid).

Probably plays a role in the control of regulated secretion in neural and endocrine cells. This protein acts as an attachment receptor for the C. botulinum neurotoxin type A2 (BoNT/A, botA). Probably also serves as a receptor for the closely related C. botulinum neurotoxin type A1.

21) SEQ ID NO: 21 PDMIRYFQDEEYKSKMKVFFGEHVYGATINFTMEN QIHQHGKLVNDKFTRMYFKHVLFEDTFFDECYFED VTSTDTYFKNCTIESTIFYNTDLYEHKFINCRFIN STFLEQKEGCHMDLEQDND

Human

Synaptic Vesicle Glycoprotein 2C Extracellular Domain Protein Sequence (459-578 Amino Acid).

Plays a role in the control of regulated secretion in neural and endocrine cells, enhancing selectively low-frequency neurotransmission. This protein acts as an attachment receptor for C. botulinum neurotoxin type A (BoNT/A, botA). Also serves as a receptor for the closely related C. botulinum neurotoxin type A2.

22) SEQ ID NO: 22 KPLQSDEYALLTRNVERDKYANFTINFTMENQIHT GMEYDNGRFIGVKFKSVTFKDSVFKSCTFEDVTSV NTYFKNCTFIDTVFDNTDFEPYKFIDSEFKNCSFF HNKTGCQITFDDDYS

Human

Synaptotagmin I Extracellular Domain Protein Sequence (1-57 Amino Acid).

Calcium sensor that participates in triggering neurotransmitter release at the synapse. This protein acts as an attachment receptor for C. botulinum neurotoxin type B (BoNT/B, botB)

23) SEQ ID NO: 23 MVSESHHEALAAPPVTTVATVLPSNATEPASPGEG KEDAFSKLKEKFMNELHKIPLP

Human

Synaptotagmin II Extracellular Domain Protein Sequence (1-62 Amino Acid).

Exhibits calcium-dependent phospholipid and inositol polyphosphate binding properties. This protein acts as an attachment receptor for C. botulinum neurotoxin type B (BoNT/B, botB)

24) SEQ ID NO: 24 MRNIFKRNQEPIVAPATTTATMPIGPVDNSTESGGAGESQEDMFAKLKE KLFNEINKIPLPP Human

HLA Class II Histocompatibility Antigen, DRB1 Beta Chain Extracellular Domain Protein Sequence (30-227 Amino Acid).

A beta chain of antigen-presenting major histocompatibility complex class II (MHCII) molecule. This protein acts as an attachment receptor for Epstein-Barr virus and Staphylococcal enterotoxin A and B.

25) SEQ ID NO: 25 GDTRPRFLWQPKRECHFFNGTERVRFLDRYFYNQEESVRFDSDVGEFRA VTELGRPDAEYWNSQKDILEQARAAVDTYCRHNYGVVESFTVQRRVQPK VTVYPSKTQPLQHHNLLVCSVSGFYPGSIEVRWFLNGQEEKAGMVSTGL IQNGDWTFQTLVMLETVPRSGEVYTCQVEHPSVTSPLTVEWRARSESAQ SK Human

Human

HLA Class II Histocompatibility Antigen, DR Alpha Chain Extracellular Domain Protein Sequence (26-216 Amino Acid).

Binds peptides derived from antigens that access the endocytic route of antigen presenting cells (APC) and presents them on the cell surface for recognition by the CD4 T-cells. This protein acts as an attachment receptor for Epstein-Barr virus BZLF2/gp42, Staphylococcus aureus enterotoxin A/entA, enterotoxin B/entB, enterotoxin C1/entC1, enterotoxin D/entD, and enterotoxin H/entH.

26) SEQ ID NO: 26 IKEEHVIIQAEFYLNPDQSGEFMFDFDGDEIFHVDMAKKETVWRLEEFG RFASFEAQGALANIAVDKANLEIMTKRSNYTPITNVPPEVTVLTNSPVE LREPNVLICFIDKFTPPVVNVTWLRNGKPVTTGVSETVFLPREDHLFRK FHYLPFLPSTEDVYDCRVEHWGLDEPLLKHWEFDAPSPLPETTE

Human

T Cell Receptor Beta Variable 7-9 Mature Protein Sequence (22-115 Amino Acid).

V region of the variable domain of T cell receptor (TR) beta chain that participates in the antigen recognition. This protein acts as an attachment receptor for Staphylococcus aureus enterotoxin A/entA.

27) SEQ ID NO: 27  GVSQNPRHKITKRGQNVTFRCDPISEHNRLYWYRQTLGQGPEFLTYFQN EAQLEKSRLLSDRFSAERPKGSFSTLEIQRTEQGDSAMYLCASSL

Human

T Cell Receptor Beta Variable 19 Mature Protein Sequence (22-114 Amino Acid).

V region of the variable domain of T cell receptor (TR) beta chain that participates in the antigen recognition. This protein acts as an attachment receptor for Staphylococcus aureus enterotoxin B/entB.

28) SEQ ID NO: 28 GITQSPKYLFRKEGQNVTLSCEQNLNHDAMYWYRQDPGQGLRLIYYSQI VNDFQKGDIAEGYSVSREKKESFPLTVTSAQKNPTAFYLCASSI

Human

Hepatitis a Virus Cellular Receptor 1 Extracellular Domain Protein Sequence (21-364 Amino Acid).

May play a role in T-helper cell development and the regulation of asthma and allergic diseases. This protein acts as an attachment receptor for Hepatitis A virus, Ebola virus, Marburg virus and Dengue virus and Clostridium perfringens Epsilon toxin (ETX).

29) SEQ ID NO: 29 SVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNGIVWTNGTHV TYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDMKI TVSLEIVPPKVTTTPIVTTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVP TTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMP LPRQNHEPVATSPSSPQPAETHPTTLQGAIRREPTSSPLYSYTTDGNDT VTESSDGLWNNNQTQLFLEHSLLTANTTKGIYAGVCISVLVLLALLGVI IAKKYFFKKEVQQLSVSFSSLQIKALQNAVEKEVQAEDNIYIENSLYAT D

Human

Myelin and Lymphocyte Protein Protein Sequence (1-153 Amino Acid).

Could be an important component in vesicular trafficking cycling between the Golgi complex and the apical plasma membrane. This protein acts as an attachment receptor for Clostridium perfringens Epsilon toxin (ETX).

30) SEQ ID NO: 30 MAPAAATGGSTLPSGFSVFTTLPDLLFIFEFIFGGLVWILVASSLVPWP LVQGWVMFVSVFCFVATTTLIILYIIGAHGGETSWVTLDAAYHCTAALF YLSASVLEALATITMQDGFTYRHYHENIAAVVFSYIATLLYVVHAVFSL IRWKSS

Human

Complement Factor H Mature Protein Sequence (19-1231 Amino Acid).

Glycoprotein that plays an essential role in maintaining a well-balanced immune response by modulating complement activation. This protein binds to Streptococcus pneumoniae, Neisseria meningitides, Staphylococcus aureus, Borrelia burgdorferi and West nile virus.

31) SEQ ID NO: 31 EDCNELPPRRNTEILTGSWSDQTYPEGTQAIYKCRPGYRSLGNVIMVCR KGEWVALNPLRKCQKRPCGHPGDTPFGTFTLTGGNVFEYGVKAVYTCNE GYQLLGEINYRECDTDGWTNDIPICEVVKCLPVTAPENGKIVSSAMEPD REYHFGQAVRFVCNSGYKIEGDEEMHCSDDGFWSKEKPKCVEISCKSPD VINGSPISQKIIYKENERFQYKCNMGYEYSERGDAVCTESGWRPLPSCE EKSCDNPYIPNGDYSPLRIKHRTGDEITYQCRNGFYPATRGNTAKCTST GWIPAPRCTLKPCDYPDIKHGGLYHENMRRPYFPVAVGKYYSYYCDEHF ETPSGSYWDHIHCTQDGWSPAVPCLRKCYFPYLENGYNQNYGRKFVQGK SIDVACHPGYALPKAQTTVTCMENGWSPTPRCIRVKTCSKSSIDIENGF ISESQYTYALKEKAKYQCKLGYVTADGETSGSITCGKDGWSAQPTCIKS CDIPVFMNARTKNDFTWFKLNDTLDYECHDGYESNTGSTTGSIVCGYNG WSDLPICYERECELPKIDVHLVPDRKKDQYKVGEVLKFSCKPGFTIVGP NSVQCYHFGLSPDLPICKEQVQSCGPPPELLNGNVKEKTKEEYGHSEVV EYYCNPRFLMKGPNKIQCVDGEWTTLPVCIVEESTCGDIPELEHGWAQL SSPPYYYGDSVEFNCSESFTMIGHRSITCIHGVWTQLPQCVAIDKLKKC KSSNLIILEEHLKNKKEFDHNSNIRYRCRGKEGWIHTVCINGRWDPEVN CSMAQIQLCPPPPQIPNSHNMTTTLNYRDGEKVSVLCQENYLIQEGEEI TCKDGRWQSIPLCVEKIPCSQPPQIEHGTINSSRSSQESYAHGTKLSYT CEGGFRISEENETTCYMGKWSSPPQCEGLPCKSPPEISHGVVAHMSDSY QYGEEVTYKCFEGFGIDGPAIAKCLGEKWSHPPSCIKTDCLSLPSFENA IPMGEKKDVYKAGEQVTYTCATYYKMDGASNVTCINSRWTGRPTCRDTS CVNPPTVQNAYIVSRQMSKYPSGERVRYQCRSPYEMFGDEEVMCLNGNW TEPPQCKDSTGKCGPPPPIDNGDITSFPLSVYAPASSVEYQCQNLYQLE GNKRITCRNGQWSEPPKCLHPCVISREIMENYNIALRWTAKQKLYSRTG ESVEFVCKRGYRLSSRSHTLRTTCWDGKLEYPTCAKR

Human

Hepatocyte Growth Factor Receptor Extracellular Domain Protein Sequence (25-932 Amino Acid).

Receptor tyrosine kinase that transduces signals from the extracellular matrix into the cytoplasm by binding to hepatocyte growth factor/HGF ligand. This receptor acts as an attachment receptor for Listeria monocytogenes internalin InlB, mediating entry of the pathogen into cells.

32) SEQ ID NO: 32 ECKEALAKSEMNVNMKYQLPNFTAETPIQNVILHEHHIFLGATNYIYVL NEEDLQKVAEYKTGPVLEHPDCFPCQDCSSKANLSGGVWKDNINMALVV DTYYDDQLISCGSVNRGTCQRHVFPHNHTADIQSEVHCIFSPQIEEPSQ CPDCVVSALGAKVLSSVKDRFINFFVGNTINSSYFPDHPLHSISVRRLK ETKDGFMFLTDQSYIDVLPEFRDSYPIKYVHAFESNNFIYFLTVQRETL DAQTFHTRIIRFCSINSGLHSYMEMPLECILTEKRKKRSTKKEVFNILQ AAYVSKPGAQLARQIGASLNDDILFGVFAQSKPDSAEPMDRSAMCAFPI KYVNDFFNKIVNKNNVRCLQHFYGPNHEHCFNRTLLRNSSGCEARRDEY RTEFTTALQRVDLFMGQFSEVLLTSISTFIKGDLTIANLGTSEGRFMQV VVSRSGPSTPHVNFLLDSHPVSPEVIVEHTLNQNGYTLVITGKKITKIP LNGLGCRHFQSCSQCLSAPPFVQCGWCHDKCVRSEECLSGTWTQQICLP AIYKVFPNSAPLEGGTRLTICGWDFGFRRNNKFDLKKTRVLLGNESCTL TLSESTMNTLKCTVGPAMNKHFNMSIIISNGHGTTQYSTFSYVDPVITS ISPKYGPMAGGTLLTLTGNYLNSGNSRHISIGGKTCTLKSVSNSILECY TPAQTISTEFAVKLKIDLANRETSIFSYREDPIVYEIHPTKSFISGGST ITGVGKNLNSVSVPRMVINVHEAGRNFTVACQHRSNSEIICCTTPSLQQ LNLQLPLKTKAFFMLDGILSKYFDLIYVHNPVFKPFEKPVMISMGNENV LEIKGNDIDPEAVKGEVLKVGNKSCENIHLHSEAVLCTVPNDLLKLNSE LNIEWKQAISSTVLGKVIVQPDQNFT

Human

Membrane Cofactor Protein (CD46) Extracellular Domain Protein Sequence (35-343 Amino Acid).

Acts as a cofactor for complement factor I, a serine protease which protects autologous cells against complement-mediated injury by cleaving C3b and C4b deposited on host tissue. This protein acts as an attachment receptor for Adenovirus subgroup B2 and Ad3, Measles virus, Herpesvirus 6/HHV-6, Neisseria and Streptococcus pyogenes.

33) SEQ ID NO: 33 CEEPPTFEAMELIGKPKPYYEIGERVDYKCKKGYFYIPPLATHTICDRN HTWLPVSDDACYRETCPYIRDPLNGQAVPANGTYEFGYQMHFICNEGYY LIGEEILYCELKGSVAIWSGKPPICEKVLCTPPPKIKNGKHTFSEVEVF EYLDAVTYSCDPAPGPDPFSLIGESTIYCGDNSVWSRAAPECKVVKCRF PVVENGKQISGFGKKFYYKATVMFECDKGFYLDGSDTIVCDSNSTWDPP VPKCLKVLPPSSTKPPALSHSVSTSSTTKSPASSASGPRPTYKPPVSNY PGYPKPEEGILDSLD

Human

Glycophorin-A Extracellular Domain Protein Sequence (20-91 Amino Acid).

Glycophorin A is the major intrinsic membrane protein of the erythrocyte. This protein acts as an attachment receptor for Plasmodium falciparum, Influenza virus, Hepatitis A virus (HAV), Streptococcus gordonii.

34) SEQ ID NO: 34 SSTTGVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAHEVSEISVR TVYPPEEETGERVQLAHHFSEPE

Human

C-Type Lectin Domain Family 4 Member K (Langerin, CD207) Extracellular Domain Protein Sequence (65-328 Amino Acid).

Calcium-dependent lectin displaying mannose-binding specificity. This protein binds to Candida species, Saccharomyces species, Malassezia furfur, human immunodeficiency virus-1 (HIV-1) and Yesinia pestis.

35) SEQ ID NO: 35 PRFMGTISDVKTNVQLLKGRVDNISTLDSEIKKNSDGMEAAGVQIQMVN ESLGYVRSQFLKLKTSVEKANAQIQILTRSWEEVSTLNAQIPELKSDLE KASALNTKIRALQGSLENMSKLLKRQNDILQVVSQGWKYFKGNFYYFSL IPKTWYSAEQFCVSRNSHLTSVTSESEQEFLYKTAGGLWWIGLTKAGME GDWSWVDDTPFNKVQSVRFWIPGEPNNAGNNEHCGNIKAPSLQAWNDAP CDKTFLFICKRPYVPSEP

Human

Anthrax Toxin Receptor 1 Mature Protein Sequence (33-564 Amino Acid).

Plays a role in cell attachment and migration. Interacts with extracellular matrix proteins and with the actin cytoskeleton. This protein acts as an attachment receptor for Anthrax toxin.

36) SEQ ID NO: 36 EDGGPACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMS FIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQ IYYENRQGYRTASVIIALTDGELHEDLFFYSEREANRSRDLGAIVYCVG VKDFNETQLARIADSKDHVFPVNDGFQALQGIIHSILKKSCIEILAAEP STICAGESFQVVVRGNGFRHARNVDRVLCSFKINDSVTLNEKPFSVEDT YLLCPAPILKEVGMKAALQVSMNDGLSFISSSVIITTTHCSDGSILAIA LLILFLLLALALLWWFWPLCCTVIIKEVPPPPAEESEEEDDDGLPKKKW PTVDASYYGGRGVGGIKRMEVRWGEKGSTEEGAKLEKAKNARVKMPEQE YEFPEPRNLNNNMRRPSSPRKWYSPIKGKLDALWVLLRKGYDRVSVMRP QPGDTGRCINFTRVKNNQPAKYPLNNAYHTSSPPPAPIYTPPPPAPHCP PPPPSAPTPPIPSPPSTLPPPPQAPPPNRAPPPSRPPPRPSV

Human

Anthrax Toxin Receptor 2 Extracellular Domain Protein Sequence (34-318 Amino Acid).

Necessary for cellular interactions with laminin and the extracellular matrix. This protein acts as an attachment receptor for Anthrax toxin.

37) SEQ ID NO: 37 QEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAERFVSPEMRLSF IVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQI QKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKISRSLGASVYCVGVLD FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEILELQPSSV CVGEEFQIVLSGRGFMLGSRNGSVLCTYTVNETYTTSVKPVSVQLNSML CPAPILNKAGETLDVSVSFNGGKSVISGSLIVTATECSNG

Artificial armY-Angiotensin-converting enzyme 2 (ACE2) fusion protein with N-terminal Myc-tag codon-optimized (for human) nucleotide sequence (3,897 bp) 38) SEQ ID NO: 38 atgtacaggatgcaactcctgtcttgcattgcactaagtcttgcacttgtcacaaacagtgagcaaaagcttatctctgaagaggacttact aagaaagcggggcagcccaggcggagcgcagagcacaatcgaggaacaggccaagaccttcctggacaagttcaaccacgaagctgaagac ctgttctaccaatctagcctggctagttggaactacaacaccaacattacagaagagaacgtgcagaacatgaacaacgcaggcgacaagtggtcc gccttccttaaagagcagtctacactggcccagatgtaccctctgcaagagattcagaatctgaccgtgaagctgcagctgcaggctctccagcaga atgggtccagcgtgctgtctgaggataagagcaagcggctgaacaccatcctgaatacaatgagcaccatctacagcaccggcaaagtgtgtaac cctgacaacccccaggagtgtctgctgctggaacctggcctgaacgaaatcatggccaactccctggactacaacgagagactgtgggcctggga gagctggcgtagcgaggtgggaaaacagctgcgccccctgtatgaggagtacgtggtgctgaagaatgagatggccagagccaaccactacga ggactacggcgactattggagaggcgattatgaagtcaacggcgttgacggctacgactacagccggggacagctgatcgaagacgtggaacat acgtagaggagatcaagcctctgtacgagcacctgcacgcctacgtaagagccaaactgatgaatgcctaccccagctacatctcccctatcggct gcctgcccgcccatctgctcggcgacatgtggggcagattctggaccaacctgtattctctgacagtgcctttcggccagaaacctaacatcgacgt gacagatgccatggtggaccaggcctgggatgcccaaagaatcttcaaggaagccgagaaattcttcgtgtccgtggggctgcctaatatgaccca gggcttctgggaaaacagcatgctcaccgatcctggcaacgtgcagaaggcagtgtgccaccccaccgcctgggaccttggaaagggcgacttc cggattctgatgtgcaccaaggtgaccatggacgacttcctgaccgctcaccacgagatgggccacatccagtacgacatggcctacgccgctca gcctttcctcctgagaaacggcgctaatgaaggcttccacgaggccgtgggcgaaatcatgagcctgagcgccgccacccctaagcacctgaagt ctatcggactgctgagccccgactttcaggaggacaacgaaactgagatcaacttcttgctgaaacaggccctgacaatcgttggcaccctgccctt tacctacatgctggaaaagtggagatggatggtctttaagggcgaaatccccaaggaccaatggatgaagaagtggtgggagatgaagcgggaa atcgtgggcgtggtggaacctgtgccccacgacgagacatactgcgatcctgctagcctctttcacgtgagcaatgattactcattcatccggtacta caccagaactctgtaccagttccagttccaggaggccctgtgccaggccgccaagcacgagggccctctgcacaagtgcgacatctctaacagca ccgaggccggccagaagctgttcaacatgctgagactgggcaagagcgaaccttggacactggccctggagaacgtggtcggagccaagaaca tgaacgtgagaccactgctgaactacttcgagcccctgttcacctggctgaaggatcaaaacaagaacagcttcgtgggctggtccacagactgga gcccatacgctgatcagagcatcaaagtgaggatctctctgaagagcgccctgggagataaggcctacgagtggaacgataatgagatgtacctgt tcagaagcagcgtggcctacgccatgcggcagtacttcctgaaagtgaagaaccagatgatcctgtttggcgaggaggatgtgagagtggccaat ctgaaaccaagaatcagctttaactttttcgttaccgctcctaagaacgtgtctgatatcatccctagaaccgaggtggaaaaggccatcagaatgag ccggtccagaatcaacgatgccttccgactgaatgacaactccctggagttcctgggaatccagcccaccctgggccctcctaaccagcctccagt cagcggcggaggaggatctggcggtggaggctctggcggcggcggttcaacaaatctggtgaaccagagcggctacgccctggtggccagcg gcagatccggcaatctgggcttcaagctgttcagcacccagtctccatctgccgaggtgaagctgaagagcctgagccttaacgacggcagctac cagtccgagatcgacctgtcaggcggcgccaacttccgagaaaagttcagaaacttcgccaatgagctgagcgaggccatcacaaacagcccta aaggcctggacagacctgtgcccaagacggaaatcagcggcctgatcaagacaggcgacaactttatcacccctagcttcaaggccggatattat gaccacgtggcctctgatggctccctactgagctactaccagtccaccgagtacttcaacaacagagttctgatgcctatcctgcagacaacaaacg gcactctgatggccaacaaccggggctacgacgacgttttcagacaagtgccctctttcagcggctggagcaacacaaaggccaccactgtgtcc acaagcaacaatctgacatacgataagtggacctatttcgccgccaaaggcagccccctgtacgacagctaccccaaccacttcttcgaggacgtg aagacactggccattgacgctaaggacatcagcgccctgaaaaccaccatcgacagcgagaagcctacctacctgattatccggggactgagcg gaaacggcagccagctgaacgagctgcaactgcctgagtccgtgaaaaaggtgagcctgtacggcgactacaccggcgtgaacgtggctaagc agatcttcgccaacgttgtggaactggaattctacagcaccagcaaggctaactcttttggctttaaccccctggtcctgggatctaaaacgaacgtga tctacgacctgttcgcaagcaagcccttcacccacatcgacctgacacaggtgaccctgcaaaacagcgataattccgccatcgatgccaacaagc tgaagcaagctgtgggcgatatctacaactacaggcggttcgagagacagtttcagggctacttcgccggaggctacatcgacaagtacctggtga agaacgtcaataccaacaaggatagcgatgacgatctggtctaccggagcctgaaagagctgaacctccacctggaggaagcctacagagaagg cgataacacctactacagagtgaatgagaactattaccctggagctagcatctacgagaacgagagagccagcagagacagcgagttccagaac gagatcctgaagcgagccgagcagaacggcgtgacatttgacgagaacatcaaaagaatcacagccagcggcaagtatagcgtgcagttccaaa agctagaaaatgataccgattccagcctggaaagaatgaccaaggccgtggaaggccttgtgaccgtgatcggcgaggaaaagttcgagacagt ggatatcaccggcgtgtctagcgataccaatgaagtgaaaagcctggccaaggaactgaagaccaacgccctgggcgtcaagctgaaactctaa Artificial Protein M with N-terminal peptide Avi- and Myc-tags (aka: armY) codon-optimized (for human) nucleotide sequence (1,764 bp) 39) SEQ ID NO: 39 atgtacaggatgcaactcctgtcttgcattgcactaagtcttgcacttgtcacaaacagtatggctggtggcctgaatgacatctttgaggc ccagaagatcgagtggcatgagggaggagagcagaagctgatctccgaggaagatctgctgagaaagcgggccgccaacggcggaggagga tctggcggtggaggctctaccaatctggtgaaccagagcggatacgccctggtggcctctgggagaagcggaaatctgggatttaagctgttcagt acccagtctccaagcgctgaagtgaagctgaaaagcctctccctgaacgacggctcttatcagagcgagatcgacctgagcggcggcgctaactt ccgggagaagttccgcaacttcgctaatgagctgtctgaagccatcacaaacagccctaagggcctggatagacctgtgcccaagacagaaatca gcggcctgatcaagactggagataactttatcacccctagctttaaggccggctactacgaccatgtggctagcgacggttcactgctgtcctactac cagtctacagagtactttaacaaccgggtgctgatgcctatactgcagaccaccaacggcaccctgatggccaataacagaggctacgatgacgtg ttccggcaggtgcccagcttcagcggctggagcaacacaaaggccacaaccgtgagcacctccaacaacctgacctacgacaagtggacctact tcgccgccaagggctctccactgtatgacagctatcctaaccacttcttcgaggacgtgaagacactggccatcgacgccaaggacatctctgccct gaagaccaccatcgacagtgagaaacctacatacctgattatcagaggactgtccggcaacggcagccagctgaacgagcttcagctgcctgaga gcgtgaaaaaggtgagcctgtacggcgactacacaggcgtcaatgtagctaagcaaatcttcgccaacgtggtggaactcgaattctacagcacat ccaaggccaacagcttcggcttcaaccccctggtgctgggcagcaagaccaacgtgatctacgacctgttcgccagcaagcctttcacccacatcg acctgacacaagtgaccctgcagaacagcgataacagcgccattgatgccaacaagctcaaacaggccgtgggcgatatctacaactacagaag attcgagaggcagtttcagggctacttcgccggaggctatatcgataagtacctggtcaagaacgtgaacaccaacaaggactccgacgacgacct ggtgtaccggagcctgaaggaactgaacctgcacctggaagaggcctacagagagggcgataatacctactacagagtgaacgagaactactac cccggagctagcatctacgagaacgagagagcctctagagatagcgagttccagaacgagatcctgaagcgggccgagcagaatggcgtgaca ttcgacgagaacatcaagcggatcaccgccagcggcaagtactccgtgcagttccaaaaactggaaaatgacaccgacagcagcctggaaagaa tgaccaaggctgtggaaggcctggttacagttatcggcgaggagaagtttgaaaccgtggacatcaccggcgtgagctccgataccaatgaggtg aaatctctggccaaagaactgaagacaaatgccctgggcgtcaaattaaaactgtaa Artificial Protein M horseradish peroxidase (HRP) fusion protein with N-terminal Myc-tag codon- optimized (for human) nucleotide sequence (2,631 bp) 40) SEQ ID NO: 40 atgtacaggatgcaactcctgtcttgcattgcactaagtcttgcacttgtcacaaacagtgagcagaaactcatctcagaagaggatctgg cagcaaatcagctgaccccaaccttctacgacaattcttgtccaaacgtctccaacatcgtgcgggacaccattgtgaacgagctgagaagcgacc ctagaatcgccgcttctatcctgagactgcatttccacgactgcttcgtgaatggctgcgacgcctccatcctgctggacaacaccaccagcttccgg acagagaaagacgccttcggaaatgccaacagcgctagaggcttccccgttatcgacagaatgaaggctgccgtggaatctgcctgccctcggac cgtgagctgtgccgacctgctgaccatcgccgcccagcagagcgtgaccctggccggcggtcctagctggcgggtgcctctgggccggagaga tagtctgcaggccttcctggatctggctaatgctaacctccccgctcctttctttaccctgcctcagctgaaggacagctacggaacgtcggcctaaa cagaagcagcgacctggtggccctgtccggaggccacaccttcggcaagaaccagtgcagattcatcatggaccggctgtacaacttcagcaata ccggcctgccagatcctacactgaacacaacctacctgcagacactgagaggcctgtgccccctcaacgggaatctgagcgccttggtggacttc gacctgagaacccctaccatcttcgacaacaagtactacgtgaacctggaagaacagaagggcctgatccaaagcgatcaggagctgttctcttcc cctaatgccacagacaccatccccctggtgcggtcattcgccaacagtacccagaccttttttaacgcttttgtggaagccatggatagaatgggcaa catcacccctctgaccggaacacagggacagatcagactgaattgcagagtggtgaacagcaactctggcggaggaggatctggcggtggagg ctctggcggcggcggttcaacaaatctggtgaaccagagcggctacgccctggtggccagcggcagatccggcaatctgggcttcaagctgttca gcacccagtctccatctgccgaggtgaagctgaagagcctgagccttaacgacggcagctaccagtccgagatcgacctgtcaggcggcgccaa cttccgagaaaagttcagaaacttcgccaatgagctgagcgaggccatcacaaacagccctaaaggcctggacagacctgtgcccaagacggaa atcagcggcctgatcaagacaggcgacaactttatcacccctagcttcaaggccggatattatgaccacgtggcctctgatggctccctactgagct actaccagtccaccgagtacttcaacaacagagttctgatgcctatcctgcagacaacaaacggcactctgatggccaacaaccggggctacgac gacgttttcagacaagtgccctctttcagcggctggagcaacacaaaggccaccactgtgtccacaagcaacaatctgacatacgataagtggacc tatttcgccgccaaaggcagccccctgtacgacagctaccccaaccacttcttcgaggacgtgaagacactggccattgacgctaaggacatcagc gccctgaaaaccaccatcgacagcgagaagcctacctacctgattatccggggactgagcggaaacggcagccagctgaacgagctgcaactg cctgagtccgtgaaaaaggtgagcctgtacggcgactacaccggcgtgaacgtggctaagcagatcttcgccaacgttgtggaactggaattctac agcaccagcaaggctaactcttttggctttaaccccctggtcctgggatctaaaacgaacgtgatctacgacctgttcgcaagcaagcccttcaccca catcgacctgacacaggtgaccctgcaaaacagcgataattccgccatcgatgccaacaagctgaagcaagctgtgggcgatatctacaactaca ggcggttcgagagacagtttcagggctacttcgccggaggctacatcgacaagtacctggtgaagaacgtcaataccaacaaggatagcgatgac gatctggtctaccggagcctgaaagagctgaacctccacctggaggaagcctacagagaaggcgataacacctactacagagtgaatgagaact attaccctggagctagcatctacgagaacgagagagccagcagagacagcgagttccagaacgagatcctgaagcgagccgagcagaacggc gtgacatttgacgagaacatcaaaagaatcacagccagcggcaagtatagcgtgcagttccaaaagctagaaaatgataccgattccagcctggaa agaatgaccaaggccgtggaaggccttgtgaccgtgatcggcgaggaaaagttcgagacagtggatatcaccggcgtgtctagcgataccaatg aagtgaaaagcctggccaaggaactgaagaccaacgccctgggcgtcaagctgaaactctaa Artificial armY-Angiotensin-converting enzyme 2 (ACE2) fusion protein codon-optimized (for human) nucleotide sequence (3,837 bp) 41) SEQ ID NO: 41 atgtacaggatgcaactcctgtcttgcattgcactaagtcttgcacttgtcacaaacagtcagagcacaatcgaggaacaggccaagac cttcctggacaagttcaaccacgaagctgaagacctgttctaccaatctagcctggctagttggaactacaacaccaacattacagaagagaacgtg cagaacatgaacaacgcaggcgacaagtggtccgccttccttaaagagcagtctacactggcccagatgtaccctctgcaagagattcagaatctg accgtgaagctgcagctgcaggctctccagcagaatgggtccagcgtgctgtctgaggataagagcaagcggctgaacaccatcctgaatacaat gagcaccatctacagcaccggcaaagtgtgtaaccctgacaacccccaggagtgtctgctgctggaacctggcctgaacgaaatcatggccaact ccctggactacaacgagagactgtgggcctgggagagctggcgtagcgaggtgggaaaacagctgcgccccctgtatgaggagtacgtggtgc tgaagaatgagatggccagagccaaccactacgaggactacggcgactattggagaggcgattatgaagtcaacggcgttgacggctacgacta cagccggggacagctgatcgaagacgtggaacatacgtagaggagatcaagcctctgtacgagcacctgcacgcctacgtaagagccaaactg atgaatgcctaccccagctacatctcccctatcggctgcctgcccgcccatctgctcggcgacatgtggggcagattctggaccaacctgtattctct gacagtgcctacggccagaaacctaacatcgacgtgacagatgccatggtggaccaggcctgggatgcccaaagaatcttcaaggaagccgag aaattcttcgtgtccgtggggctgcctaatatgacccagggcttctgggaaaacagcatgctcaccgatcctggcaacgtgcagaaggcagtgtgc caccccaccgcctgggaccttggaaagggcgacttccggattctgatgtgcaccaaggtgaccatggacgacttcctgaccgctcaccacgagat gggccacatccagtacgacatggcctacgccgctcagcctacctcctgagaaacggcgctaatgaaggcttccacgaggccgtgggcgaaatca tgagcctgagcgccgccacccctaagcacctgaagtctatcggactgctgagccccgactacaggaggacaacgaaactgagatcaacttcttgc tgaaacaggccctgacaatcgttggcaccctgccattacctacatgctggaaaagtggagatggatggtattaagggcgaaatccccaaggacc aatggatgaagaagtggtgggagatgaagcgggaaatcgtgggcgtggtggaacctgtgccccacgacgagacatactgcgatcctgctagcct ctttcacgtgagcaatgattactcattcatccggtactacaccagaactctgtaccagttccagttccaggaggccctgtgccaggccgccaagcacg agggccctctgcacaagtgcgacatctctaacagcaccgaggccggccagaagctgttcaacatgctgagactgggcaagagcgaaccttggac actggccctggagaacgtggtcggagccaagaacatgaacgtgagaccactgctgaactacttcgagcccctgttcacctggctgaaggatcaaa acaagaacagcttcgtgggctggtccacagactggagcccatacgctgatcagagcatcaaagtgaggatctctctgaagagcgccctgggagat aaggcctacgagtggaacgataatgagatgtacctgttcagaagcagcgtggcctacgccatgcggcagtacttcctgaaagtgaagaaccagat gatcctgtttggcgaggaggatgtgagagtggccaatctgaaaccaagaatcagctttaactttttcgttaccgctcctaagaacgtgtctgatatcatc cctagaaccgaggtggaaaaggccatcagaatgagccggtccagaatcaacgatgccttccgactgaatgacaactccctggagttcctgggaat ccagcccaccctgggccctcctaaccagcctccagtcagcggcggaggaggatctggcggtggaggctctggcggcggcggttcaacaaatct ggtgaaccagagcggctacgccctggtggccagcggcagatccggcaatctgggcttcaagctgttcagcacccagtctccatctgccgaggtga agctgaagagcctgagccttaacgacggcagctaccagtccgagatcgacctgtcaggcggcgccaacttccgagaaaagttcagaaacttcgc caatgagctgagcgaggccatcacaaacagccctaaaggcctggacagacctgtgcccaagacggaaatcagcggcctgatcaagacaggcg acaactttatcacccctagcttcaaggccggatattatgaccacgtggcctctgatggctccctactgagctactaccagtccaccgagtacttcaaca acagagttctgatgcctatcctgcagacaacaaacggcactctgatggccaacaaccggggctacgacgacgttttcagacaagtgccctctttcag cggctggagcaacacaaaggccaccactgtgtccacaagcaacaatctgacatacgataagtggacctatttcgccgccaaaggcagccccctgt acgacagctaccccaaccacttcttcgaggacgtgaagacactggccattgacgctaaggacatcagcgccctgaaaaccaccatcgacagcga gaagcctacctacctgattatccggggactgagcggaaacggcagccagctgaacgagctgcaactgcctgagtccgtgaaaaaggtgagcctg tacggcgactacaccggcgtgaacgtggctaagcagatcttcgccaacgttgtggaactggaattctacagcaccagcaaggctaactcttttggct ttaaccccctggtcctgggatctaaaacgaacgtgatctacgacctgttcgcaagcaagcccttcacccacatcgacctgacacaggtgaccctgca aaacagcgataattccgccatcgatgccaacaagctgaagcaagctgtgggcgatatctacaactacaggcggttcgagagacagtttcagggcta cttcgccggaggctacatcgacaagtacctggtgaagaacgtcaataccaacaaggatagcgatgacgatctggtctaccggagcctgaaagagc tgaacctccacctggaggaagcctacagagaaggcgataacacctactacagagtgaatgagaactattaccctggagctagcatctacgagaac gagagagccagcagagacagcgagttccagaacgagatcctgaagcgagccgagcagaacggcgtgacatttgacgagaacatcaaaagaat cacagccagcggcaagtatagcgtgcagttccaaaagctagaaaatgataccgattccagcctggaaagaatgaccaaggccgtggaaggcctt gtgaccgtgatcggcgaggaaaagttcgagacagtggatatcaccggcgtgtctagcgataccaatgaagtgaaaagcctggccaaggaactga agaccaacgccctgggcgtcaagctgaaactctaa Artificial armY-CD209 (DC-SIGN) fusion protein codon-optimized (for human) nucleotide sequence (2,706 bp) 42) SEQ ID NO: 42 atgtaccgaatgcagctgctgtcttgtattgccctgtccctggccctggttaccaattctcaagtgagcaaggtgcccagcagcatctctc aggagcagagcagacaggacgccatctaccagaacctgactcaactgaaggcggctgtgggcgaactgagcgagaagtctaagctgcaggag atctatcaggaactgacacaactgaaggctgccgtgggggaattacccgagaagagcaagctgcaggaaatctaccaggagctgaccagactca aagccgccgtgggcgagctgccagagaagtctaaactgcaggaaatctaccaggaattgacatggctgaaggcagctgttggcgagctgcctga gaaaagcaagatgcaggagatttaccaggagctcacacggctgaaggccgccgtcggcgaactccccgagaaaagcaagcagcaggagatct accaggagcttacaagacttaaggccgctgtgggagagctgcctgagaagtccaaacaacaggaaatctaccaagaactgaccagactgaaagc cgccgtgggagaactgccagaaaaaagcaagcagcaggagatctaccaagaactgacacagcttaaagcagctgttgagcggctgtgtcaccca tgcccttgggagtggacattcttccagggcaactgctacttcatgagcaatagccaaaggaactggcacgacagcatcacagcctgcaaggaagt gggggcccagctggtggtgatcaagtccgccgaagaacaaaatttcctgcagctgcagtcctccagaagcaacagattcacatggatgggcctgt cagacctgaaccaagaaggcacctggcagtgggtcgatggcagccccctgctgccctattcaagcagtactggaaccgcggcgagcctaacaa tgtgggcgaggaagattgcgccgagtttagcggcaacggctggaatgacgacaagtgcaacctcgccaagttctggatctgtaaaaagtccgccg cctcctgcagccgcgacgaggagcagtttctgtcccctgcccccgccacccctaatcctcctcccgccggcggtggcggaagcggcggcggcg gcagcggaggaggcggcagcaccaacctggtgaatcagagcggctacgccctggtggcctctggtagatctggcaacctgggattcaagctgtt cagcacacagtctcctagtgccgaagtgaagctgaagtcactgagcctgaacgacggcagctaccagagcgaaatcgacctgtctggcggtgcta acttcagagagaagttccggaacttcgccaacgagctgtccgaggccattaccaacagtcccaagggcctggaccggcctgtgcctaagaccga gatcagcggcctgatcaagaccggcgacaacttcatcacccctagattaaggctggctactacgaccacgtggcctccgatggctctctgctgtcc tattatcagagcacagagtacttcaacaatagagtgctgatgcctatcctgcaaacaaccaacggcaccctgatggccaataataggggatacgac gacgtattcggcaggtgcctagcttctccggctggagcaacaccaaggccacaaccgtgtctacaagcaacaacctgacatacgacaagtggac ctactttgccgccaaggggagccctctgtacgactcttatcctaatcatttcttcgaggacgtgaagaccctggccatcgatgccaaggatatcagcg ccctgaagaccaccatcgacagcgaaaaacccacctacctgatcatccggggcctgagcggcaatggcagccagctgaacgaactgcagctgc cagaaagcgtgaagaaggtgtctctgtacggcgactacaccggcgtgaacgtggctaagcagatcttcgccaatgttgttgagcttgagttctacag cacgagcaaggccaactcattcggatcaaccccctggtgctgggaagtaagacaaacgtgatctatgacctgtttgccagcaaacctttcacccac atcgacctgacccaggtgaccctgcagaacagcgacaacagcgccattgatgctaacaagctgaaacaggccgtgggagacatctacaactacc ggagattcgagagacagttccaaggctacttcgccggcggctatatcgataagtacctggtgaaaaacgtgaacaccaacaaggatagcgatgac gacctggtgtacagaagcctgaaggaactgaacctgcacctggaggaagcctacagagaaggcgataacacatactacagagtgaacgagaact actaccctggagccagcatctacgagaacgagagagcctctcgggactccgagttccagaacgaaatcctgaaacgggccgagcagaacggcg tgacatttgatgaaaacatcaagagaatcaccgctagcggcaagtacagcgtgcagtttcagaagctggagaacgacactgattctagcctggaaa gaatgaccaaggcggtcgagggcctggtgaccgtgatcggcgaggagaagttcgaaaccgtggacatcaccggcgtgtccagcgacaccaatg aggtgaaatctctggccaaagagctgaagaccaacgccctcggagtgaagctgaagctgtaa Artificial armY-C-type lectin domain family 4 member M fusion protein codon-optimized (for human) nucleotide sequence (2,655 bp) 43) SEQ ID NO: 43 atgtaccggatgcagctgctgtcttgtatcgccctgagcctggccctggtcaccaattctcaggtgtctaaggtgccttctagcctgagcc aggagcagtctgagcaggacgctatctaccagaacctgacacagcttaaggccgctgtgggcgaactgtcagaaaagtctaagctccaagagatc taccaggagcttacacagctgaaagccgccgtgggcgagctgcctgagaagtccaagttgcaagagatctaccaggagctgacccggctgaaag ccgccgtgggagagctgcccgagaagagcaaactgcaggaaatctatcaggagctgaccagactgaaggccgccgtgggagagctgcccgag aaatccaagctacaggagatctaccaggagctgacaagactgaaggccgcagtgggcgagctgccagaaaagagcaagctgcaggagatctac caggaactgacagagctgaaggccgccgttggagaactgcctgaaaagtccaaactgcaggaaatctatcaggagctgacacagctgaaggctg ccgtgggcgaactccctgaccagtccaagcagcagcagatttaccaggaactgaccgacctgaaaacagccttcgagagactgtgtagacactgc cctaaggactggacattcttccagggcaactgctacttcatgagcaacagccagcggaactggcacgacagcgtgaccgcctgtcaggaggtgc gggcccagctggtggtcatcaagaccgccgaagagcaaaacttcctgcagctgcaaacaagcagaagcaacagattcagctggatgggcctga gcgatctgaaccaggagggcacctggcagtgggtggatggaagccctctgtctccaagcttccaaagatactggaacagcggagagcctaacaa ctctggaaatgaggactgcgccgagttcagcggttctggctggaatgacaacagatgcgacgtggacaactactggatctgcaagaaacccgccg cctgcttccgagatgagggcggtggcggaagcggcggcggaggcagcggaggcggcgggagtaccaacctggtgaatcagagcggctacgc cctggtcgcctcgggcagatccggcaatctgggcttcaagctgttcagcacacaaagcccttctgctgaagtgaaactgaagagcctgagcctgaa tgatggctcttaccagagcgagatcgacttatccgggggagccaactttcgggaaaaattcagaaacttcgctaacgagctgagcgaggccatcac caactcccccaagggcctggatagacctgtgcccaagacagagatcagcggcctgatcaagaccggcgataacttcatcacccctagctttaagg ccggatactacgaccacgtggcttccgatggcagcctgctgagctactaccagagcaccgagtacttcaacaacagagtactgatgcctatcctgc agacaacaaatggcaccctgatggccaacaataggggctacgatgacgtgttcagacaggttccttcattcagcggctggagcaatacgaaggct acaaccgtgtcgaccagcaacaacctgacctatgacaagtggacctacttcgccgctaagggcagccctctgtacgacagctaccccaaccacttc ttcgaggatgtgaaaaccctggccattgacgccaaggacatcagcgccctgaaaaccaccatcgacagcgagaagcctacatacctgatcatcag aggcctgtcaggcaacggctcccagctgaacgaactgcaactgccagagagtgttaagaaggtgagcctgtacggcgactatacaggagtgaac gtggctaagcagatcttcgctaatgtggtggaactggaattctacagcaccagcaaagccaacagcttcggctttaaccccctggtgctgggcagca agaccaacgtgatctacgaccttttcgccagcaagcccttcacccacatcgacctgacccaggtgaccctgcagaatagcgacaattctgccattga cgccaacaagctgaaacaggccgtgggcgatatctacaactacaggcggttcgaaagacagttccaaggctattttgccggcggctacatcgaca agtacctggtcaagaacgtgaacaccaacaaggattccgacgacgatctagtgtaccggagcttgaaggaactcaacctgcatctggaagaggcc tacagagaaggcgacaacacatactaccgcgtgaacgagaactactaccctggcgccagcatctacgagaacgaacgggcttctagagatagcg agtttcagaatgaaatcctgaagagagccgaacagaacggcgtgaccttcgacgagaacattaagcggatcacagcctctggcaagtacagcgtg cagtttcagaagctggaaaacgacaccgacagctctctcgagagaatgaccaaggccgttgagggcctggtgacagtgatcggcgaggaaaagt tcgaaaccgtggacatcaccggcgtgtcctctgataccaacgaggtgaagagcctggcaaaggaactgaagaccaacgccctgggcgtgaagct gaagctgtaa Artificial armY-CD4 fusion protein codon-optimized (for human) nucleotide sequence (2,781 bp) 44) SEQ ID NO: 44 atgtacagaatgcagctgctgagctgcatcgccctgtccctggccctggttacaaacagcaagaaggtggtgctgggaaaaaagggc gacaccgtggaactgacctgcaccgctagccagaagaagagcatccaatttcactggaagaacagcaaccagatcaaaatcctggggaaccagg gctctttcctgacaaagggcccctctaagctgaatgatagagccgacagccggagatcgctgtgggaccagggcaacttccccctgatcatcaaga acctgaagatcgaggatagtgacacatacatctgcgaggtggaagatcagaaggaagaggtgcaactgctggtgttcggactgaccgccaacag cgacactcacctgctgcagggccagtctctcacactaaccctggaaagccctcctggaagctctccaagcgtccagtgtagatctcctagaggcaa gaacatccagggcggcaagaccctactgtgtctcagctggagctgcaggactcaggcacctggacatgtaccgtactgcaaaatcagaaaaagg tggaattcaagatcgacatcgttgtgctggccttccagaaggccagcagcatcgtgtacaagaaggaaggagagcaggtggagttttctttccctct cgcctttaccgtggaaaaactgaccggttcaggcgagctgtggtggcaggccgagcgcgcaagctccagcaagagctggatcacattcgacctta agaacaaagaggtgagcgtgaagagagtgacccaggaccccaagctgcagatgggcaagaagctgcccctgcacctgaccctcccgcaagcc ctgcctcagtacgccggatccggcaacctgacactggccctcgaagccaaaaccggaaagctgcaccaggaggtgaacctggtggtgatgaga gccacccagctgcagaaaaatctgacctgcgaagtgtggggccctacaagccctaagctcatgctgagtcttaaactggagaacaaggaggctaa agtgagcaagcgggaaaaggccgtgtgggtgctgaatcctgaggccggcatgtggcagtgcctgctgtctgacagcgggcaagtgctgctggaa tctaacatcaaggtcctgcccacctggtccacccctgtgcagccaggcggcggaggatctggcggcggcggcagcggaggcggcggctccac caacctggtgaatcagagcggctacgccctggtggctagcggtagatccggcaatctgggattcaagcttttctccacacagagccctagcgccga agtgaagttgaaatctctgagcctgaacgacggctcctaccagtccgagatcgacctgagcggcggcgctaatttcagagagaagtacggaactt cgccaatgagctgtctgaagctatcaccaacagccctaaaggacttgatcgcccagtgcccaagaccgagattagcggcctgatcaagacaggcg ataactttatcacccctagtttcaaggctggctattatgaccacgtggccagcgacggaagcctgctgagctactaccagagcacagagtacttcaac aaccgggtgctgatgcctatcctgcagaccaccaacggcacgctgatggccaacaacagaggctacgacgacgtgttccggcaggtgcctagctt tagcggatggagcaacaccaaggctacaactgtgagcaccagcaacaacctgacctacgataagtggacctacttcgccgccaaaggcagccct ctgtacgatagctaccctaaccacttcttcgaggacgtgaagacactggctatcgacgccaaggacattagcgccctgaaaaccacaattgactctg aaaagcccacctacctgatcatcagaggactgagcggcaacggcagccagctgaacgagctgcagctgcctgaatctgtgaaaaaagtcagcctt tacggcgactacaccggcgtgaacgtggccaagcagatcttcgccaatgtggtggaactggagttctacagcacctctaaagccaacagtacggc ttcaaccccctggtgctgggctctaaaaccaatgtaatttatgacctcttcgctagcaagcctacacacacatcgatctgacccaggtgacactgcag aactctgacaacagcgccatcgatgccaataagctgaagcaggccgtgggcgacatctacaactaccggagattcgagagacagtacagggcta ctttgccggcggctacatcgataagtacctggttaagaacgtgaataccaacaaggactctgatgacgacctggtgtacagaagcctgaaggaact gaacctgcatctggaagaggcctacagagaaggcgacaacacctactatcgggtgaatgagaactactatcccggcgcttctatctacgagaatga gcgggccagcagagatagtgagttccaaaatgagatcctgaagcgggcagagcaaaacggcgtgaccttcgacgagaacatcaagagaatcac cgcctccggcaaatacagcgtgcagttccagaaactggaaaacgacactgatagcagcctggaacggatgaccaaggccgtagagggcctggt caccgtgatcggcgaggagaagtttgagacagtggacatcacaggcgtgagctccgataccaacgaggtgaagagcctggccaaggaactgaa gaccaacgccctgggagtgaagctgaagctataa Artificial armY-Synaptic vesicle glycoprotein 2A fusion protein codon-optimized (for human) nucleotide sequence (2,058 bp) 45) SEQ ID NO: 45 atgtacagaatgcagctgctgtcatgcatcgccctctccctcgccctggtgaccaacagccccgacatgatcagacacctgcaggccg tcgactacgccagcagaaccaaagtgttccccggagaacgggtggaacacgtgacatttaacttcaccctggaaaaccagatccacagaggcgg ccagtacttcaacgacaagttcatcggcctgagactgaagtccgtgtccttcgaggatagcctgtagaggaatgctactagaggacgtgacatcta gcaatacctttttccggaactgcacattcatcaacaccgtgttctacaacaccgatctgtttgaatacaagttcgtgaacagcagactgatcaacagca cctactgcacaacaaggagggctgtcctttagatgtgaccggaacgggcgagggcgcctacatggtgtacggcggcggaggctccggcggcg gtggcagcggtggaggaggcagcaccaatctggtcaaccaatctggctatgccctggtcgccagtggcagaagcgggaacctgggcttcaagct gttcagcacacagagccctagcgctgaagtgaaactgaagagcctgtctctgaacgacggctcttatcagagcgagatcgacctgtccggaggcg ccaatttcagagagaagttcaggaacttcgccaacgagctgagcgaggccatcaccaattcccctaagggactggatagacctgtgccaaaaacc gagattagcggcctgattaagaccggagataatttcatcacacccagctttaaggccggatattacgaccacgtggcctctgacggcagcctgctga gctactaccagagcaccgagtacttcaacaaccgggtgctgatgcctatcctgcaaacaacaaatggcacactgatggccaacaaccggggatat gacgacgtgttccgccaggtgcccagcttcagcggctggagcaacacaaaggctacaaccgtgtctaccagcaacaacctgacctacgataagtg gacctacttcgccgctaaaggcagccctctgtacgacagctaccccaaccacttcttcgaggacgtcaagaccctggcgatagacgccaaagaca tcagcgctctgaagaccaccatcgacagcgaaaagccaacatacctgatcatcagaggcctgagcggcaacggctcacagctgaacgagctgca gctgcctgagagcgtgaaaaaggtgtcactgtacggcgattacaccggcgtgaacgtggccaagcagatcttcgcaaacgttgtggaactggaatt ctactctacaagcaaggccaacagcttcggctttaatcctctggtgctggggtctaagacaaacgtgatctacgacctgttcgccagtaagcctacac ccacatcgacctgacccaggttacactgcagaactccgacaacagcgccatcgacgccaacaagctgaaacaggccgtgggcgacatctacaac tacaggagattcgaaagacagttccagggctattttgccggcggctacatcgacaagtacctggtgaagaacgtgaataccaacaaggactctgat gacgatctcgtgtaccggagcctgaaggaactgaatctgcatctggaagaagcttaccgggaaggcgacaatacctactacagagtgaacgagaa ctactaccctggcgctagcatctacgagaacgaacgggccagcagagattctgagttccaaaacgagatcctgaagcgggccgagcagaatggc gtcaccttcgacgagaacatcaagagaatcaccgcctctggcaaatacagcgtgcagttccaaaaactggaaaacgatactgatagctcccttgag agaatgaccaaggccgtggaaggactggtgaccgtgatcggcgaagagaagttcgagacagtggacatcacaggcgtgtccagcgataccaat gaggtgaagagcctggccaaggagctgaaaaccaacgccctcggcgtgaagctgaagctgtaa Artificial armY-Synaptic vesicle glycoprotein 2B fusion protein codon-optimized (for human) nucleotide sequence (2,040 bp) 46) SEQ ID NO: 46 atgtacagaatgcagttgctgtcttgtatcgccctcagcctggctctggtgacgaatagcccagacatgatccgctacttccaggacgag gaatacaagagcaagatgaaggtgttctaggcgagcatgtgtacggcgccaccatcaacttcaccatggaaaaccagatccaccagcacggcaa gctggttaatgacaagtttacaagaatgtactttaagcacgtgctgttcgaggataccttttttgatgagtgctacttcgaggacgtgacaagcaccgac acatacttcaagaactgcaccatcgagagcaccatcttctacaacaccgacctgtatgagcacaagttcatcaactgcagatttatcaacagcaccttc ctggaacagaaagagggctgccacatggacctggaacaagacaatgatggaggcggaggaagcggcggcggaggcagcggcggcggggg aagcaccaatctggtgaatcaaagcggctacgccctggtggctagcggcagaagcggcaacctgggcttcaagctgtttagcacacagagcccta gcgctgaagtgaagctgaagtctctctctctgaatgacggctcctaccagtctgagatcgacctcagcggaggcgccaacttcagggaaaagttcc ggaacttcgccaacgagctgagcgaggccattacaaacagccctaagggcctggacagacctgtgcccaagaccgagatcagcggcctgatca agactggagataattttattacccctagcttcaaggcaggctactacgaccacgtggcctccgatggctctctgctgtcctattatcagagcacagagt actttaacaacagagtgctgatgcctatcctgcagaccacaaacggcaccctgatggccaacaatagaggctatgatgatgtgttcagacaggtgcc ttctttcagcggatggtccaacacaaaggccacaacagtttctacaagcaacaacctgacctacgataagtggacatacttcgccgccaagggctct ccactgtacgacagctaccctaaccacttcttcgaagatgtgaagaccctggccatcgacgccaaggacatcagcgcccttaaaacaaccattgac agcgagaagcctacctacctgatcatcagaggactgagcggaaacggctcccagctgaacgaactgcaactgcctgagtctgtgaaaaaggtga gcctgtacggcgattacaccggcgttaacgtggctaaacagatcttcgccaacgtggtggaactggagttctacagcaccagcaaggccaatagct tcgggttcaaccccctggtccttggctccaaaaccaacgtcatctacgacctgttcgcttctaagcccttcacacacatcgacctgacccaggttaccc tgcagaactcagacaacagtgctatcgacgccaacaaactgaagcaggccgtgggcgatatctataactaccggagattcgagcggcagttccaa ggctacttcgccggcggatatatcgacaagtacctggtcaagaacgtgaacaccaacaaggacagcgatgacgacctggtgtaccggagcctga aggaactgaacctgcacctggaagaagcctaccgggaaggcgacaacacctactaccgggtgaacgagaattactaccccggcgctagcatcta cgagaacgagagagcctccagagattcagagttccagaacgagatcctgaaaagagccgagcagaatggcgtgaccttcgacgagaacatcaa gcggatcacagcctctggcaaatacagcgtgcagttccagaagctggaaaatgataccgatagcagcctggaaagaatgaccaaggcggtggaa ggcttggtcaccgtgatcggcgaggagaagttcgagacagtggacatcaccggcgtgtccagcgacaccaacgaggtgaaaagcctggccaag gaactgaagaccaacgccctgggcgtgaagctgaagctgtaa Artificial armY-Synaptic vesicle glycoprotein 2C fusion protein codon-optimized (for human) nucleotide sequence (2,028 bp) 47) SEQ ID NO: 47 atgtaccgcatgcagctgctgagctgcatcgccctgagcctggctctggtgacaaacagcaaacctctgcagagcgacgagtacgcc ctgctgacaagaaacgtcgagcgggacaagtacgccaattttaccatcaactttaccatggaaaaccagatccacaccggaatggaatacgataat ggcagattcattggcgttaagttcaaaagcgtgacattcaaagatagcgtgttcaagagctgtacattcgaagatgtgaccagcgtaaatacctacttc aaaaactgcaccttcatcgacaccgtgttcgacaacaccgatttcgagccttacaagttcatcgacagcgagttcaagaactgcagctttttccacaac aaaaccggatgtcagatcaccttcgacgacgactacagcggcggcggcggctcgggcggaggaggctctggtggcggcggcagcacaaacct ggtcaaccagagcgggtatgccctggtggccagcggcagaagcggcaatctgggcttcaagctgttcagcacacagtccccaagcgctgaggtg aagctcaaatctctgtcccttaacgacggcagttaccaaagcgagatcgacctgagcggcggagccaacttccgggaaaagttcagaaatttcgct aatgaactgagcgaggccatcacgaatagccctaagggcctggatagacccgtgcccaagactgagatcagcggcctgattaagacaggagata acttcatcacacctagcttcaaggccggctattacgaccacgtggcctcagacggctccctgctgagctactaccagagcacagagtacttcaacaa ccgggtgctgatgcctatcctgcagaccaccaacggaacactgatggccaacaacagaggctatgacgatgtgtttagacaggtcccctcttttagc ggatggtccaacaccaaggctacaacagtgtccaccagcaacaacctgacctacgacaagtggacatatttcgccgccaagggaagccctctgta cgacagctacccaaaccacttcttcgaggacgtgaagaccctggccattgacgccaaagacatcagcgccctgaagaccacaatcgattctgaga aacctacctatctgatcatcagaggactctctggcaacggcagccagctgaacgagctgcagctgcctgagagcgtgaaaaaggtgtccctgtac ggcgattacaccggcgtgaacgtggccaagcagatcttcgccaacgtggtggaacttgagttctacagcaccagcaaggccaattctttcggcttca accccctggtcctgggcagcaagacaaatgtgatctacgacctgttcgcctctaagcctttcacccacatcgacctgacccaggtgacactgcaaaa ttccgataacagcgccatcgacgctaacaagctgaagcaggccgtgggcgacatctacaactaccggcggtttgagcggcagtttcagggctactt tgctggcggatacatcgacaagtacctggtgaagaacgtgaacacaaacaaggactctgatgacgacctggtttaccggtctctgaaggaactgaa cctccatctggaagaagcctacagagaaggcgacaacacctactacagggtgaacgagaactactaccccggcgctagcatctacgagaacgaa agagcctctagagatagcgaatttcagaacgagatcctgaagagagctgaacagaatggcgtgacctttgatgagaacatcaagcggatcaccgc ctccggcaagtacagcgtgcagttccaaaagctggagaatgataccgactccagcctggaaagaatgaccaaggcagtggagggcctggtgacc gtgatcggcgaggaaaagttcgagacagtggacatcaccggcgttagcagcgacaccaacgaggtgaagtctctggccaaggaactgaagacc aacgccctgggagtgaaactgaagctgtaa Artificial armY-Synaptotagmin I fusion protein codon-optimized (for human) nucleotide sequence (1,839 bp) 48) SEQ ID NO: 48 atgtacagaatgcagctgctgagctgcatcgccctgagcctggccctggttacaaacagcatggtgtccgagagccaccacgaggcc ttagcagctcctcctgtgaccaccgtggctacagtgctgcccagcaatgccaccgagcctgccagccctggagagggaaaagaggacgcctttag caagctgaaggagaagttcatgaacgagctgcataagatccctctgcctggaggtggcggcagcggaggaggtggctccggcggcggcggctc caccaacctggtgaaccagagcggctacgccctggtggccagcggaagaagcggcaacctgggcttcaagctgttttctacgcagagccccagc gccgaagtgaagctgaagagcctgtcactgaacgacggcagctatcagtctgagatcgacctgtctggcggggccaatttcagagagaaatttag aaacttcgctaatgagctgagcgaggccatcaccaactcgcccaagggcctggacagacctgtgcccaagaccgaaatcagcggcctgattaaa acaggcgataacttcatcaccccttcttttaaggctggctactacgaccacgtggccagcgatggcagcctgctgtcttactaccagagcacagagt actttaacaacagagtgctgatgcctatcctgcagaccaccaacggaacactgatggccaacaaccggggctacgacgacgtcttcagacaggtg cctagcttctctggctggtccaacaccaaggcgacaaccgtgtccaccagcaacaatctgacatacgataagtggacctacttcgctgccaagggc tccccactgtacgactcttatccaaaccacttcttcgaggatgtgaaaactctggctatcgacgccaaggacatcagcgctctgaagaccacaatcga cagcgaaaagcccacctacctgatcatcagaggactgagcggaaatggctcacagctgaacgaactgcagctgcctgagtctgtgaagaaggtgt ccctctacggcgactacaccggcgtcaacgtggccaagcaaatcttcgccaatgtggtggaactggaattctacagcaccagcaaggccaacagc ttcggcttcaaccccctggtgctggggagcaaaacaaacgtgatctatgacctgttcgccagcaagcctttcacccacatcgatctgacccaagtga ccctgcagaacagcgataatagcgccatcgacgccaacaagctcaagcaggccgtgggcgatatctacaactacaggcggttcgagagacagtt tcagggctacttcgccggcggctacatcgacaaatacctggtcaagaacgtgaacaccaacaaagactctgatgacgacctggtctaccggagcc tgaaagagcttaatctgcacctggaagaggcctaccgggaaggcgacaacacatactacagagtgaacgagaactactacccaggcgccagtatt tacgagaacgaacgcgcctctagagatagcgagttccaaaatgagattttaaaaagagccgagcagaacggcgtgacattcgacgagaacatcaa gcggatcaccgcctccggcaagtacagcgtgcagttccagaagctggaaaatgataccgacagcagcctggaacggatgaccaaggccgtgga aggcctggtgaccgtgatcggcgaggaaaagttcgaaaccgtcgacatcacaggcgtgtctagcgacaccaatgaggtgaagagccttgctaag gaactgaagacaaacgccctgggcgtgaaactgaagctgtaa Artificial armY-Synaptotagmin II fusion protein codon-optimized (for human) nucleotide sequence (1,854 bp) 49) SEQ ID NO: 49 atgtaccggatgcagctgctgagctgcatcgccctgtccctggccctggtgacaaacagcatgagaaacattttcaagagaaaccagg agcctatcgtggcccctgctacaaccacagccacaatgcctatcggccctgtggataattcgactgaaagcggcggagccggcgagtcccaaga agatatgttcgccaagctgaaagagaaactgttcaacgagatcaacaagatccccctgcctccaggcggcggcggcagcggaggaggcggcag cggtggcggcggcagcacaaatctggtaaaccagagcggctacgccctggttgcctccggaagaagcggaaacctgggatttaagctgttcagc acccagtccccatctgctgaagtgaaactgaagagcctgagcctgaatgacggctcttaccagagcgagatcgacctgagtggaggcgccaattt cagagagaaattccgcaacttcgccaatgagctgagcgaggccatcaccaacagccctaagggcctggacagacctgtgcccaagaccgaaatc agcggactgatcaagaccggcgacaacttcatcaccccttcttttaaggctggatattacgaccacgtggcctctgacggatctctgctgagctacta ccagtctaccgagtacttcaacaaccgggtgctgatgccaattcttcagacaaccaacggcaccctgatggccaacaatagaggctacgacgatgt gttccggcaagtgcctagcttttctggctggagcaacaccaaggccaccaccgtgtccaccagcaacaacctcacctatgataagtggacctacttt gctgctaaaggcagccccctgtacgactcttatcctaaccacttcttcgaagatgtgaagaccctggctatcgatgccaaggacatcagcgccctga aaaccaccatcgacagcgagaagcccacctacctgatcatcagaggcctatctggcaacggcagccagctgaacgagctgcagctccctgagag cgtgaagaaggtgtctctgtacggcgattacaccggcgttaatgtggctaaacagatcttcgccaacgtggtggaactggaattctacagcacatcta aagcaaacagttttggcttcaatcctctggtgctgggcagcaagaccaacgtgatctacgacctgtttgctagcaagcccttcacacacatcgatctg acccaggtgaccctgcaaaactccgataatagcgccattgacgccaacaaactcaagcaggccgtgggcgatatctacaactacaggcggttcga gagacagttccagggctacttcgccggcggatatatcgacaagtacctggtcaagaacgtcaacacaaacaaggacagcgatgacgacctggtct accggagcctgaaggaactgaacctgcatctggaggaagcctacagagaaggcgacaacacctactacagagtgaacgagaactactaccccg gcgccagcatctacgagaatgaaagagcctcaagagattccgagttccagaacgagatcctgaagcgggccgagcagaacggcgtgacattcg acgagaacatcaagcggatcaccgccagcggcaagtacagcgtgcagtttcagaagctggaaaacgacaccgactcaagcctggaaagaatga caaaggccgtggaaggcctggtgactgtgatcggcgaagagaagttcgagacagtggacatcacaggcgtgtctagcgacaccaacgaggtga aaagcctggccaaggaactgaagacaaacgccctgggcgtgaagctgaagctataa Artificial armY-HLA class II histocompatibility antigen, DRB1 beta chain fusion protein codon- optimized (for human) nucleotide sequence (2,262 bp) 50) SEQ ID NO: 50 atgtaccggatgcagctgctgagctgcatcgccctgtctcttgccctggtgaccaactctggagacaccagacctagattcctgtggca gcccaagagggaatgtcactttttcaacggtacagagcgggtgagattcctggaccggtacttctacaaccaggaggaaagcgtgcggtttgatag cgacgtgggcgagttccgggctgtgactgaactgggccggcccgatgccgagtactggaacagccagaaggatatcctggagcaggccagag ccgcagtggacacctactgcagacacaactacggcgttgtggaaagcttcaccgtgcaaagaagagtgcagcctaaagtgaccgtgtacccatct aaaacacagcctctgcagcaccacaatctgctggtatgcagcgtgtccggcttctaccctggcagcatcgaggtgcggtggttcctgaacggccag gaggaaaaagccggcatggtgtctaccggcctgatccagaatggcgactggaccttccagaccctggtgatgctggaaacagtgcctagatccgg cgaggtgtacacctgccaggtggagcaccccagcgtcaccagcccactgaccgtggaatggcgggccagatctgagagcgctcagagcaagg gcggcggcggaagcggcggcggaggaagcggcggcggcggcagcacaaatctggtcaaccagagcggctacgccctggtggccagtggca gaagcgggaacctgggctttaagctgtttagcacccagagccccagcgccgaagtgaagctgaaaagcctgtccctgaacgacggcagctacca gagcgagatcgacctgtccggcggagccaacttcagagagaagttcagaaactttgccaacgagctgagcgaggccattacaaatagccctaag ggcctggatagaccagtgcctaagaccgagattagcggcctgatcaagaccggcgataacttcatcacaccttcctttaaggccggttactatgacc acgtggccagcgacggctccctcctgagctactatcagtctaccgagtacttcaacaaccgggtgctgatgcctatcctgcaaacaacaaacggca ccctgatggccaacaacagaggctacgacgatgtgttcagacaagtgccctctttcagcggatggagcaacaccaaggctacaaccgtctccacta gcaacaacctcacctacgacaagtggacctattttgccgccaagggcagccctctgtacgacagctaccctaaccacttcttcgaggacgtgaaga ccctggccatcgacgctaaggacatcagcgcccttaagaccacaatcgattctgagaagcctacctacctgatcatccggggcttatctggcaacg gctctcagctgaatgagctgcagctgccggaaagcgtgaagaaggtgtccctctacggcgactacacaggcgtgaatgttgccaagcagatcttc gccaacgtggtggaactagaattctactccaccagcaaggctaacagctttggcttcaatcctctggtgctgggcagcaaaaccaatgtgatctatga tctgttcgcttctaagcccttcacccacatcgatctgacacaggtgaccctgcagaacagcgacaatagcgccatcgacgctaacaagctgaaaca ggctgtgggcgacatctacaactaccggagattcgagagacaattccagggctacttcgccggaggatatatcgacaagtacctggtgaaaaacgt gaacaccaacaaggattctgatgacgacctggtttacaggagcctgaaggaactgaaccttcatctggaagaagcctacagagagggcgacaata catactacagagtgaacgagaattactaccccggcgccagcatctacgagaacgaaagagcctctagagacagcgagttccaaaacgaaatcctc aagcgcgctgagcagaacggagtgacattcgacgagaacattaagcggatcaccgccagcggcaagtacagcgtccagttccagaaactggaa aacgacaccgattctagcctggaaaggatgaccaaggccgtggaaggcctggtaacagtgatcggagaggagaaattcgagacagttgacatca ccggggtgagcagcgatacaaatgaggtgaagtctctggccaaggaactgaaaaccaacgccctgggagtcaagctgaagctgtaa Artificial armY-HLA class II histocompatibility antigen, DR alpha chain fusion protein codon- optimized (for human) nucleotide sequence (2,241 bp) 51) SEQ ID NO: 51 atgtaccggatgcagctgctgtcatgcatcgccctgagcctcgctctggttaccaatagcatcaaggaagagcacgtgatcatccaggc cgagttctacctgaatcctgatcagagcggagagttcatgttcgacttcgacggcgatgagatctacatgtggacatggccaaaaaggaaaccgtgt ggcggctggaagagtaggccggttcgcctccttcgaggcccagggagctaggccaatatcgccgtggacaaggccaatctggagatcatgacc aagcggagcaactacacccctatcaccaacgtgccacctgaggtgacagtgctgaccaatagccccgtggagctgcgggaacctaacgttctgat ctgcttcatcgacaagtttacaccccccgtggtgaatgttacatggctgagaaacgggaagcctgtgaccacaggagtgtccgagacagtgttcctg cctagagaagaccacctgttccggaagttccactacctgcccttcctgccttccaccgaggacgtgtacgattgtagagtggaacactggggcctgg acgagcctctcctgaagcactgggagtttgacgcaccatcccctctgcctgagacaaccgaaggcggaggcggctccggcggcggaggtagcg gaggcggcggcagcaccaacctggtcaaccagtccggatacgccctggtggccagcggcagatctggcaatctcggcttcaagcttttcagcac gcagtcccctagcgccgaagtgaaactgaaatctctgtctctgaacgacggcagctaccagagcgagatcgacctgagcggcggcgccaatttca gagagaagtttcggaacttcgccaacgagctgtccgaggctattaccaacagtccaaagggactggatagacctgtgcccaagaccgagatcagc ggcctgatcaagacaggcgacaacttcatcacccctagcttcaaggccggctactacgaccacgtggcttctgatggctctctactgagctactacc agagcacagaatactttaacaatagagtgctgatgcctatcctgcagaccactaacggcaccctgatggccaacaacagaggctacgacgacgtgt tcagacaagtgccttcttttagcggatggtccaacacgaaggccaccacagtgtctacatctaacaacctgacatatgacaagtggacctacttcgcc gccaagggcagccctctgtacgacagctatcctaatcacttcttcgaggatgtgaaaacactggctatcgacgcgaaagacattagcgccctgaag accaccatcgatagcgaaaagcccacctacctgatcatcagaggcctctctggcaacggctctcagctgaacgagctgcaacttccggagagcgt gaagaaagtgtccctgtacggcgactacaccggcgtgaacgtcgctaaacagatctttgccaacgtcgtggaactggaattctatagcaccagcaa ggccaacagcttcggcttcaaccccctggtgctgggaagcaagaccaacgtgatctatgacctctttgcttctaaacctttcacccacatcgacctga cccaggtcacactgcagaacagcgacaacagcgccatcgacgccaacaagctgaagcaggctgtgggcgatatctacaactaccgtagattcga gcgccagttccagggctatttcgccggcggctacatcgacaagtacctggtgaagaacgtgaacacaaacaaggacagcgacgatgatctggtct acagaagcctgaaggagctgaacctgcacctggaagaagcctacagagagggcgataacacctactacagggttaatgagaattactaccccgg cgctagcatctacgagaacgagcgcgccagcagagattctgaattccaaaacgagatcctgaaaagagccgaacagaacggcgtgacattcgat gagaacatcaagcggatcacagccagcggcaagtacagtgtgcagtttcagaaactggaaaacgacaccgacagcagcctggagagaatgacc aaggccgtggaaggcctggtgaccgtgatcggcgaggaaaagttcgaaaccgttgacattaccggcgtgtctagcgataccaacgaggtgaaga gcctggccaaggagctgaagacaaacgccctgggggtgaagctgaagttataa Artificial armY-T cell receptor beta variable 7-9 fusion protein codon-optimized (for human) nucleotide sequence (1,950 bp) 52) SEQ ID NO: 52 atgtaccgcatgcagctgctgagctgcatcgccctgagcctcgccctggtgaccaacagcggcgttagccagaacccccggcacaa gattaccaagoggggccagaacgtgaccttcagatgtgaccccatcagcgaacacaaccggctgtactggtacagacagacactgggccaagga cctgagttcctgacctacttccagaacgaagcccagctggagaaatctagactgctttccgatagattcagcgccgagaggcctaagggctcttttag cacactggagatccagagaacagagcagggcgatagcgcaatgtacctgtgcgccagcagcctgggcggcggcggcagcggcggaggcgg ctccggcggcggcggatctaccaacctggtgaaccagtctggctacgccctggtggcctctggtagaagcggcaacctgggctttaagctgtttag cacacagagtccctctgccgaggtgaagctgaagagcctgtccctgaacgacggcagctatcagtccgagatcgatctgagtggcggagctaact tccgggaaaagttcagaaacttcgccaatgagctgtctgaagccatcaccaatagccctaagggcctggacagacctgtgcctaagaccgagattt ctggcctgatcaagacaggtgataatttcatcacccctagctttaaggctggctactacgaccacgtggccagcgatggaagcctgctgagctacta ccagtccaccgagtacttcaacaacagagtgctcatgcctatcctgcaaaccacaaacggaacactgatggccaacaacagaggatatgatgacgt gttcagacaggtgccatctttttccggctggagcaacaccaaggccaccaccgtgtctacaagcaacaacctgacatatgacaagtggacctacttc gccgccaagggctccccactgtacgacagctaccctaaccacttcttcgaggacgtaaagacactggctatcgatgccaaagacatcagcgcctta aagaccaccatcgacagcgagaagcccacctacctgatcatcagaggactgagtggcaacggcagccagctgaatgaactgcagctgcctgaat ctgtgaagaaggtgtccctgtacggcgactacaccggagtgaacgtggccaagcagatcttcgctaatgtggtcgagctggaattctacagcacca gcaaggccaatagcttcggcttcaaccctctggtcctcggctctaagaccaacgtcatctacgacctattcgctagcaagcctttcacccacatcgac ctgacccaggtgaccctgcagaacagtgacaatagcgccatcgacgccaacaagctgaagcaagccgtgggggacatctacaactaccggaga tttgagcggcagttccagggctatttcgctggcggatacatcgacaagtacctggtgaaaaacgtgaatacaaacaaggacagcgacgacgatctg gtgtaccgctctctgaaggaactgaacctgcatctggaagaggcctacagagagggcgataatacctactaccgggtgaacgagaactactaccc cggcgcctccatctacgagaacgaacgggccagccgggacagcgaattccaaaacgagatcctgaaaagagctgaacagaatggcgtgacctt cgacgagaacatcaagagaatcaccgcctccggcaagtacagcgtgcagttccagaagctggaaaatgacactgattctagcttggaaagaatga caaaagccgtggaaggcctggtcacagtgatcggcgaggaaaagttcgagacagtggacatcacaggcgtgagcagcgataccaacgaggtg aaaagcctggctaaagagctgaagaccaacgccctgggcgttaaactgaaactgtaa Artificial armY-T cell receptor beta variable 19 fusion protein codon-optimized (for human) nucleotide sequence (1,947 bp) 53) SEQ ID NO: 53 atgtatagaatgcagctgctgtcctgcatagccctgtctctggctctggtgaccaactctgggatcacccagtccccaaagtacttgtttag aaaggagggccagaacgtcaccctgtcttgtgaacagaacctcaaccacgacgccatgtactggtaccggcaggaccctggacagggcctgag actgatctactacagccaaatcgttaatgatttccaaaagggagatattgctgagggctacagcgtgtccagagaaaagaaagaaagcttccctctg accgtgaccagcgcccagaagaaccctaccgccttctacctgtgcgcctccagcattggcggcggcggcagcggaggcggaggcagcggagg cggcggctcaacaaacctggttaaccagtccggctacgccctggtcgcctccggaagaagcggcaacctcggcttcaagctgttcagcacccag agcccttccgccgaggtgaagctgaagagcctgagcctgaacgacggcagctaccagagcgagatcgacctgtctggcggagctaatttccgcg agaagttcagaaacttcgccaacgagctgagcgaggccatcacaaacagccctaagggcctggacagacctgtgcctaagacagagatcagcg gcctgatcaagaccggcgataatttcatcacaccatcttttaaggccggatattacgaccacgtggccagcgatggcagcctgctgagctactacca gtctaccgagtactttaacaacagggtccttatgccaatcctgcaaacaacaaacggcacactgatggccaacaatcggggctatgatgatgtgttca gacaggtgccctctttcagcggatggtccaacaccaaggccaccacagtgtctaccagcaacaacctgacctacgataagtggacttacttcgccg ccaagggctcacccctgtacgacagctaccctaaccatttcttcgaagatgtgaagacgctggccatcgacgcaaaggacatcagcgccctgaag accaccatcgacagcgaaaaacccacctacctgatcatccggggcctaagcgggaatggtagccagctgaacgagctgcagctgcctgagagc gtgaaaaaggtgagcctgtacggcgactacacaggcgtgaacgtggccaaacagatcttcgctaatgtggtggaactggaattctattctacatcca aggccaacagcttcggcttcaaccccctggtgctgggctctaaaacaaacgtgatctacgacctgttcgctagcaagcctttcacccacatcgacct gacccaagtgaccctgcagaatagcgataacagcgctatcgacgccaacaagctgaagcaggccgtgggagacatctacaattacagaagatttg aaagacagttccagggctacttcgccggcggctacatcgacaaatacctggtgaagaacgtgaataccaacaaggattctgacgacgacctggtct accggtctctgaaagagctgaacctgcacctggaagaggcctaccgggagggagataacacctattaccgggtgaacgagaattactaccccgg cgcctccatctatgagaacgagagagccagcagagacagcgagttccagaacgagatcctgaaaagagccgagcagaacggcgtgaccttcga cgagaacatcaagcggatcaccgccagtggcaagtacagcgtgcagtttcaaaagctagaaaacgacacagatagcagcctggaaagaatgac caaggctgtggaaggcctggtgaccgtgatcggcgaggaaaagtttgagacagtggacatcaccggcgtgagctctgacaccaatgaggtcaaa agcctggctaaggaactgaagaccaacgccctgggcgtgaagctgaaactctaa Artificial armY-Hepatitis A virus cellular receptor 1 fusion protein codon-optimized (for human) nucleotide sequence (2,700 bp) 54) SEQ ID NO: 54 atgtaccgcatgcagcttctgtcttgtatcgccctgagcctggcgctggtcaccaacagcagcgtgaaagttggcggagaggccggtc ctagcgtcaccctgccttgccactactctggcgctgtgaccagcatgtgctggaaccggggcagctgtagcctgttcacctgccagaatggcatcgt gtggacaaacggtacacacgtgacatacagaaaggacacaagatacaagctgctgggcgacctgtcaagacgggatgtgtctctgaccatcgag aacaccgctgtaccgacagcggcgtgtactgctgcagagtggagcacagaggctggttcaatgacatgaagatcaccgtgagcctggagatcgt gcctccaaaggtgaccaccacgcctatcgtgacaaccgtacctacagtgaccaccgtgcggaccagcacaaccgtgcctaccaccaccaccgtg cccatgaccacggtgcccaccacaaccgtgccaaccaccatgagcatccccaccacgacaacagtgctgacaaccatgaccgtactacaacaac atcagtgcctaccacaacaagcattcccacaaccacaagcgtgcctgtcacaacaaccgtgtccacattcgtgcctcctatgcccctgcctagacag aatcacgagcctgtggctacctctcctagctcccctcagcctgccgagacacaccctactaccctgcagggcgccatccggagagaacccaccag cagccctctgtatagttacaccaccgacggcaatgataccgtgaccgaaagcagcgatggactgtggaacaacaaccaaacacagctgttcctgg aacattccctgctgacagccaatacaaccaagggcatctacgccggagtgtgcatctccgtgctggtcctgctggcactgctgggagttatcatcgc caagaagtactttttcaagaaggaagtgcagcagctgagcgtgagcttctccagcctgcagatcaaagattgcagaacgccgtggaaaaggaagt gcaagccgaagataacatctacatcgagaactccctgtacgccaccgatggcggcggaggctccggcggcggaggaagcggcggcggcggc tccacaaatctggtgaaccagagcgggtacgccctggtggccagcggcagaagcggaaatctgggcttcaagctgtttagcacccagagcccttc tgccgaggtgaaactgaaaagcctgtccctcaacgacggcagctaccagagcgagattgacctgagcggcggagccaatttcagagagaagttc cgcaacttcgctaacgagctgtctgaagcaatcacaaactcccctaagggactggatagacccgtgcctaaaaccgagatcagcggcctgatcaa gactggagacaatttcatcacccctagctttaaggccggctactatgaccacgttgcctccgacggcagcctgctgagctactaccagtctacagag tactttaacaacagagtgctgatgcctattctgcagacaactaacggcacactgatggccaacaatcggggctacgatgacgtgttcagacaagtgc ccagctttagcggctggagcaacaccaaggctactaccgtgtctaccagcaacaacctgacctacgacaagtggacctacttcgccgctaagggct ccccactgtatgacagttaccccaaccacttcttcgaggacgtaaagaccctggccattgacgccaaggatatcagcgccctgaaaaccaccatcg acagtgagaagcccacctacctgatcatccggggcctgagcggcaacggctctcagcttaacgagctgcagctgcctgagagcgtgaaaaaggt gagtctatacggcgactacaccggcgtgaacgtggccaaacagatcttcgccaacgtggtggagctggaattctacagcaccagcaaggccaact ctttcggcttcaaccccctcgtgctgggctccaagacaaacgtgatctacgacctgtttgcttctaaacctttcacccacatcgacctcacccaggtga ccctgcaaaatagcgataacagcgccatcgacgccaacaagctgaagcaggctgttggagatatctataactaccggagattcgaaagacagttcc aaggctatttcgccggcggctacatcgacaaatacctggtgaaaaacgtgaataccaacaaggacagcgacgatgacctggtgtacagatctctga aggagctgaacctgcacctggaagaggcctacagagaaggcgacaacacatactacagagtgaacgagaactactacccaggagcttctatcta cgagaatgaaagagccagcagagactctgagttccagaacgagatcctgaagcgggccgagcagaacggcgtgaccttcgacgagaatatcaa gagaatcaccgcctccggcaagtacagcgtgcagtttcagaagctggaaaacgatacagactccagcctggaacggatgacaaaggccgtgga gggcctggtgaccgtgatcggcgaggaaaaattcgaaaccgtggacatcaccggcgtctccagcgataccaacgaggtgaagagcctggccaa ggaactgaagaccaacgccctgggagtgaagctgaagctataa Artificial armY-Myelin and lymphocyte protein fusion protein codon-optimized (for human) nucleotide sequence (2,127 bp) 55) SEQ ID NO: 55 atgtacagaatgcagctgctgagctgcatcgccctgtccctggccctggtgaccaatagcatggcccctgccgccgctaccggcggta gcacactgcctagcggcttcagcgtgtttacaacactgcctgacctgctctttatcttcgagttcatcttcggcggcctggtgtggatcctggtggcctc tagcctggtcccttggcccctggtgcagggctgggtcatgttcgtgtccgtgttctgcttcgtggcaacaaccacactgatcatcctgtacattatcgg cgcccacggtggcgagacaagctgggtgacactggacgccgcttatcattgtaccgccgctctgttttacctgtcagcaagcgtgctggaagccctt gccaccatcaccatgcaggatggctttacctacaggcactaccacgagaacatcgccgccgtggtgttctcctacatcgccacactgctgtatgtcgt gcacgccgtgttcagcctgattagatggaagtccagcggcggcggcggatctggcggaggcggaagcggcggcggaggctctaccaacctgg tgaaccagagcggatacgccctggtggcctctggcagaagcggaaacctgggcttcaaactgttcagcacccagtccccaagcgccgaggtgaa actgaagagcctgagcctgaatgacggcagctaccagagcgagattgacctctctggtggagccaatttcagagagaagttccggaacttcgcca acgaactgtctgaagccatcaccaacagcccaaaaggcctcgatagaccagtgcccaagaccgaaatcagcggactgatcaagaccggcgata atttcattacccctagctttaaggctggctattacgaccacgtggcttctgacggcagcctgctgagctactaccagagcaccgagtactttaacaata gagtgctgatgcctatcctgcagaccaccaacggcaccctgatggccaacaacagaggttacgacgacgtgttcagacaggtgcctagcttcagc ggctggtccaacaccaaggcgactaccgtctccacaagcaacaacctgacctacgataagtggacctacttcgccgcaaagggctctcctctgtac gacagctaccccaaccacttcttcgaagatgtgaagaccctggctatcgatgctaaagatatcagtgccctgaagacaacaatcgacagcgagaaa cctacctacctgatcatcagaggcctgagcggaaatggctcgcagctgaacgagctgcagctgcctgagtccgtgaaaaaggtgtccctctacgg cgactataccggcgtgaacgttgccaagcagatctttgctaatgtggttgagctggagttctacagcacctctaaggccaattcttttggcttcaacccc ctggtgctgggcagcaagaccaacgtgatctacgacctgttcgccagcaagcccttcacccacatcgatctcacccaagtgacactgcaaaactcc gacaacagcgccatcgacgccaacaagctgaagcaggccgtgggcgatatctacaactacagacggttcgagagacagttccagggatatttcg ccggcggctacatcgacaagtacctggtcaagaacgtgaacacgaacaaggatagcgatgacgacctggtgtaccggagcctgaaggaactga acctgcacctggaagaggcttaccgggaaggcgacaacacctactaccgcgtgaatgaaaactactaccctggcgccagcatctacgagaacga gcgggcctcccgggacagcgaattccagaatgaaatcctgaaaagagccgagcagaacggggtgaccttcgacgagaacatcaagcggatcac cgccagcggcaagtactccgtgcagttccaaaagctggaaaacgataccgacagcagcctggaaagaatgactaaggccgtcgagggcctggtt acagtgatcggcgaggaaaaatttgagacagtggacatcacaggcgtcagcagcgacacaaacgaggtgaagtctctggccaaggagctgaag accaacgcccttggagttaagctgaagttataa Artificial armY-Complement factor H fusion protein codon-optimized (for human) nucleotide sequence (5,307 bp) 56) SEQ ID NO: 56 atgtacagaatgcagctgctgtcctgcatcgccctgtctctggccctggttaccaattcagaagattgcaacgagctgcctcctcggcgg aacaccgaaatcctgaccggatcctggagcgaccagacataccccgagggcacccaggccatttacaagtgtcggcctggctacaggtcactgg ggaacgttatcatggtgtgccggaaaggcgagtgggtggccctgaaccctctgcggaagtgccagaaacggccatgtggccaccctggcgaca cccctttcggaaccttcaccctcacaggtggcaacgtctttgagtacggcgtgaaagccgtttacacatgcaatgagggataccagctgctcggaga gatcaactacagagagtgtgataccgacggatggaccaacgacatccccatctgtgaagtggtgaagtgcctccctgtcacagcccctgaaaacg gcaagatcgtgtcttctgctatggagcctgatagagaatatcactttggccaggccgtgagattcgtgtgcaactctgggtacaaaatcgagggagat gaggaaatgcactgctctgatgacggcttctggagcaaggaaaagcctaagtgcgtggagatcagctgcaagagtcctgacgtgatcaacggctc ccctatctcacagaagatcatttacaaggagaacgaaagattccagtacaaatgtaacatgggatacgagtactctgaaagaggtgatgccgtttgta ctgaatccggctggcggcctctgcctagctgcgaggagaagagctgtgacaatccttacatccccaatggagattacagccctctcagaatcaagc accgcaccggcgacgagatcacctaccagtgtcgcaacggattttaccccgctacccggggcaacaccgccaagtgtacctccacaggctggat ccctgcccccagatgcaccctgaaaccctgcgactaccctgatatcaagcacggcggcctgtatcacgagaacatgagaagaccttacttccctgt ggccgtgggcaagtactactcttattactgcgatgaacactttgaaacccctagcggcagctactgggatcacatccactgtacccaggatggctgg tctccagctgtgccatgtctgcgcaagtgctacttcccctacctggaaaacggctacaaccagaactacggtagaaagttcgtgcagggcaagtcta tcgacgtggcatgccaccccggctacgccctacctaaggctcagaccacagtgacctgtatggaaaacggttggtctcccaccccacgctgcatcc gggtgaagacctgctccaagtcttctatcgatattgaaaacggcttcatctctgaatcccaatacacctatgctctgaaggaaaaggccaagtaccag tgtaagctgggatacgtgaccgccgacggcgagacatctggctccatcacctgtggcaaggacggctggagcgcacagcccacatgcattaagt cttgcgacatcccggtgttcatgaacgccagaaccaagaacgatttcacctggttcaagctgaacgacacactggattacgagtgtcacgacggata tgaaagcaataccggcagcaccaccggcagcatagtgtgcggctacaacggctggagcgatctgcccatctgctacgaaagagaatgcgagctg cctaagatcgatgtgcacctggtgcccgatcggaagaaggaccagtacaaggtgggcgaagtgctgaagtttagctgcaagcccggattcacaat cgtgggaccaaattctgtgcagtgctaccacttcggcctgagccccgacctgcccatctgcaaggaacaagtgcagagctgtggacctcctcctga gctgctgaacggaaacgtgaaagagaagacaaaggaggagtacggccattctgaggtggtcgagtactactgtaaccctagattcctgatgaagg gccctaacaagatccaatgcgtggacggagagtggaccaccctgcccgtttgcatagtggaggaaagcacctgtggcgacatcccggaactgga acacggctgggcccagctgagcagccctccctactactacggcgattctgtcgaatttaactgtagcgagtcattcaccatgatcggccatagaagc attacttgcatccacggagtgtggactcagttacctcagtgcgttgccatcgacaagctgaagaagtgtaaatctagcaacctgatcattctggaaga acacctgaagaacaagaaagaattcgaccacaattcaaacatcagatacagatgccggggcaaagagggctggatccacaccgtgtgcatcaac ggcagatgggaccccgaggtgaactgcagcatggcccagatccagctgtgtcctcctcccccccagatcccaaacagccacaacatgaccacca cgctgaactaccgagacggcgagaaggtgagcgtgctgtgccaggagaactacctgatccaggagggcgaagagatcacatgtaaggacggtc gttggcagagcatccccctgtgcgttgaaaagatcccctgcagccagcctcctcaaatcgagcacggcaccatcaacagctccagatcctcccag gagtcctacgcccacggcacaaaactgagctacacatgcgaaggcggattccggatttctgaagagaacgagaccacctgctacatgggcaagt ggagctctccccctcaatgtgagggcctgccttgcaagagccctcctgagatcagccacggcgtggttgcccacatgtctgatagctaccaatacg gcgaggaagtgacttataagtgcttcgaggggtttgggatcgatggtcccgccattgccaagtgcctgggagaaaaatggtctcatccaccatcatg tatcaagaccgactgcctgagtttgcctagctttgagaatgctatccctatgggcgagaagaaggacgtatacaaagccggcgagcaggtgacata cacatgtgccacctactacaaaatggacggcgccagcaatgtaacgtgtataaatagcagatggacaggcagacctacctgcagagatacaagct gcgtgaatcctcccacagtccaaaatgcttatatcgtgagtcggcagatgagcaagtaccctagcggcgagagagtgagataccagtgcaggtcc ccctacgagatgttcggcgacgaggaggtgatgtgcctaaacggcaactggacggaacctcctcagtgcaaagacagcaccggaaaatgcggc cctcctcctcctattgacaacggcgatatcaccagattccactgagcgtgtacgctcctgcttcatctgtcgagtaccaatgccagaatctgtaccag ctggaaggtaataagagaatcacctgcagaaacggacagtggagcgaacctcctaagtgcctgcacccttgcgtgatctccagagagatcatgga aaactacaacatcgccctgagatggaccgccaaacagaagctgtacagccggaccggcgagagcgtcgagttcgtgtgtaagagaggttaccga ctgtcctctagaagccataccctgcggaccacctgctgggacggcaaactagagtaccctacgtgcgccaagcggggcggaggtggctcagga ggcggcggctctggcggcggcggctctacaaacctggtgaaccagagcggttatgccctggtggccagcggcaggtctggaaatctgggcttta agctgttttcaacgcagagcccttccgccgaagttaagctgaaatcactgagcctgaatgacggctcctaccagagcgagatcgacctgtctggag gagctaactttagagagaagttcaggaacttcgctaacgagctgagcgaagccatcaccaatagccctaaaggcttggacagacctgtgcccaag actgagatcagcggcttgatcaagaccggcgacaacttcatcaccccatcttttaaggccggctactacgaccacgtggcctctgacggaagcctg ctatcctactatcagtctactgagtacttcaacaacagagtgctgatgcctatcttgcagaccaccaatggcaccctgatggccaacaaccggggata tgacgatgtgttcagacaggtgcctagcttcagcggatggagcaacaccaaggcgacaaccgtgagcacatccaacaacctgacatacgacaagt ggacatattttgcggccaagggctctccactgtatgatagctaccccaatcacttcttcgaggacgtgaagaccctggccatcgacgccaaagacat cagcgcccttaagacaacgatcgattccgagaagcctacctacctgatcattagaggcctgagcggcaacggcagccagctgaacgagctgcag ctgccagagtccgtgaagaaagtgtccctgtatggcgactacacaggcgtcaacgtggccaagcaaatcttcgctaatgtggtggaacttgagttct acagcacatcgaaggctaactctacggcttcaaccccctggtgctgggcagcaagaccaatgtgatttacgacctgttcgccagcaagcccttcac acacatcgacctgacccaagtgacactgcaaaacagcgataacagcgccatcgacgccaacaagctgaagcaggctgtgggcgacatctacaa ctaccggagattcgagagacagttccagggctacttcgccggcggctacatcgataagtacctggtgaagaacgtgaataccaacaaagactctga tgacgacctggtgtacagaagcctgaaagagctgaacctgcatctggaagaagcctaccgggagggcgataacacctactaccgggtgaacgaa aactactatcctggcgctagcatctacgagaacgaacgagccagcagggattctgaattccagaacgagatcctgaagcgggccgagcagaacg gagtgacatttgatgagaacatcaaacggatcaccgccagcggcaaatactccgttcagttccaaaaactggaaaatgatacagacagcagcctgg agagaatgaccaaggccgtggaaggcctggtgacggtgatcggcgaagagaaattcgagaccgtggacatcaccggcgtaagctctgacacca acgaagtgaagagcctggctaaggaactgaagaccaacgccctgggggtcaagctgaagctgtaa Artificial armY-Hepatocyte growth factor receptor fusion protein codon-optimized (for human) nucleotide sequence (4,392 bp) 57) SEQ ID NO: 57 atgtaccggatgcaactgctgagctgcatagccttatctctggcactggtgaccaacagcgagtgcaaggaagccctcgccaagagtg aaatgaacgtgaatatgaaataccagctgcctaacttcaccgccgaaacccctatccagaacgtcatcctgcatgagcaccacatcttcctgggcgc tacaaattacatctacgtgctgaatgaggaggacttgcagaaagtcgccgaatacaagaccggacccgtgctggagcacccggactgcttcccatg tcaggattgcagttctaaggccaacctgagtggtggcgtttggaaggacaacatcaacatggccctggtggtcgacacatattacgacgatcagctg attagctgtggcagcgtgaaccggggcacctgccagagacacgtgttccctcacaaccacactgccgacatccagagcgaagtgcactgcatctt cagcccccagatcgaggagcctagccagtgtcctgactgcgtggtgtcagccctgggtgctaaggtactgtccagcgttaaggacagattcatcaa ctttttcgtgggtaacacaatcaacagcagctacttccccgatcaccctctgcacagcatatccgtgcggagactcaaggaaacaaaggacggcttc atgttcctgacagaccagagctatatcgatgtgctgcctgagttcagagattcttaccccatcaagtacgtgcacgccttcgagagcaacaattttatct atttcctgacagtccaaagggagacactcgatgcccagaccttccacaccagaatcatccggttctgcagcattaacagtggactgcactcttatatg gaaatgcccctggaatgtatcctcacagagaaaaggaagaaaagaagcactaagaaggaggtgttcaacattctgcaggctgcttacgtgtccaag cctggcgctcagctggccagacagatcggcgccagcctgaacgatgacatcctgttcggcgtcttcgcccaatctaagcctgacagcgccgagcc catggacagatctgctatgtgcgctttccccatcaagtacgtgaatgacttcttcaacaagatcgtgaacaagaacaacgtgcggtgcctgcaacact tctacggccctaaccacgagcactgttttaatagaaccctactgcggaactcctctggttgtgaagctagaagagacgaataccggaccgagttcac caccgccctgcagagggtggacctgttcatgggccaattcagcgaggtcctgctgacatctataagcaccttcatcaagggagatctgacaatcgc caacctgggcaccagtgagggcagattcatgcaggtggtggtgagtagatccggccctagtacaccccatgttaacttcctgctggactcacaccc cgtgtcccctgaggtgatcgtggaacatacactgaaccagaatggctatacactggtgatcaccggaaagaagattaccaagattcctctgaacgg cctgggctgcagacacttccagagctgtagccagtgcctgagcgcccctccttttgtgcagtgcggctggtgccacgacaagtgcgtgcgcagcg aggagtgcctgagcggcacctggacacagcagatctgtctgcctgccatctacaaggtctttccaaacagcgccccattggaaggcggaactcgg ctgacaatctgcggctgggacttcggctttcggcggaacaacaagtttgacctgaagaagacccgggtgctgctgggcaacgagagctgtaccct gaccctgagcgaaagcaccatgaacacgctgaaatgcaccgtgggcccagccatgaacaaacacttcaacatgtctatcatcatcagcaatggcc acggcacaacccagtacagcacgttcagctacgtggaccctgtgatcaccagcatctcaccgaagtacggccctatggccggcggcacattgctg accctgaccggaaattatctgaactcgggcaacagccgtcacatctccataggcggaaagacatgcacgctgaagtcggtgtctaacagcatcctg gagtgctacacaccagcccagaccatctcgacagaattcgctgtaaagctgaagatcgatctcgctaatcgagagacaagcatcttttcttacagag aggatcctatcgtgtacgagatccaccctacaaagtctttcatcagcggcggcagcaccatcacaggcgtgggaaaaaacctgaactctgtgtctgt gccgagaatggtgatcaacgtgcacgaggctggcagaaacttcacagtggcctgccagcatagaagcaacagcgaaatcatctgctgcaccacc ccctcgctgcagcagcttaatctgcagctgcccctgaaaacgaaggccttcttcatgctggatgggatcctgtctaagtacttcgatctcatctacgtg cacaatcctgtgtttaagccattcgagaagcccgtcatgatctctatgggcaacgagaacgtgctcgagatcaagggcaatgatatcgaccctgagg ccgtgaaaggcgaggtgctgaaagtgggcaacaaaagctgcgaaaacatccacctgcacagcgaagccgtgctgtgcaccgtgcctaacgactt gctgaagctgaactccgagctgaatatcgagtggaagcaggccatcagctctaccgtcctgggcaaggtgattgtgcaacctgaccagaacttcac cggcggtggcggtagtggaggcggcgggagcggaggcggaggaagcaccaacctggtgaaccagagcgggtacgccctggtagctagcgg cagaagcggcaacctgggctttaagctgttttctacccagagccctagcgccgaagtgaagctgaagagcctgagcctgaacgacggcagttacc aatccgagatcgacctgtctggcggcgccaacttcagagagaagttcagaaacttcgctaatgagctgtctgaggccatcaccaacagccctaagg gcctggatagacctgtgccaaagaccgagatctccggcctgatcaaaaccggcgataactttatcacacctagctttaaggccggctactacgacc acgtggcctccgacggctccctgctgtcctactaccagagcacagaatacttcaacaacagagtgctgatgcctatcctgcaaaccacaaacggca ccctgatggccaacaacagaggctacgacgatgtgttccggcaggtgcctagcttctccggctggagcaacaccaaggccactaccgtttctacca gtaacaacctgacctacgataagtggacctactttgccgccaagggcagccccctgtacgactcataccccaatcacttctttgaagatgtgaagac cctggccatcgatgccaaagatatcagcgctctgaaaacaaccatcgactccgagaagcccacctaccttattatcagaggcctgtccggcaacgg ctctcagctgaatgagctgcagctcccagaaagcgtgaagaaggtgtcgctgtacggcgactacaccggcgtcaatgtggccaaacagatatttgc caacgtagtagaattggaattctactctacaagcaaagccaactcttttggatttaaccccttagtgctaggatctaagacaaacgtgatctacgacctg ttcgccagcaaacctttcacccacatcgacctgacccaagtgaccctgcagaacagcgacaacagcgctatcgacgccaacaagctgaagcagg ccgtcggcgatatatacaattaccggcggttcgagagacagttccagggctacttcgccggaggatacatcgacaagtacctggtgaagaacgtga acactaataaggacagcgacgacgacctcgtgtacagaagcctgaaagaactgaatctgcacctggaagaagcctaccgggaaggagacaaca cctactacagagtgaacgaaaactactaccctggcgccagcatctatgagaacgagagagccagcagagattctgaattccagaacgagattctga aacgggccgagcagaatggcgtgaccttcgacgagaatattaagcgcatcaccgccagcggcaaatattccgtccagtttcagaagctcgagaac gacaccgacagcagcctggaaagaatgaccaaggccgtggaaggcctggtgaccgtgatcggcgaggaaaaattcgagaccgtggatatcacc ggcgtgagcagcgacacaaacgaagtgaagagcctggccaaggaactgaagaccaacgccctgggagtgaagctcaagctgtaa Artificial armY-Membrane cofactor protein (CD46) fusion protein codon-optimized (for human) nucleotide sequence (2,595 bp) 58) SEQ ID NO: 58 atgtaccgcatgcagctgctgagctgcatcgccctgtctctggctctggtgaccaacagctgcgaggaacctccaaccttcgaggccat ggaactgatcggcaagccaaagccctactatgagattggcgaaagagtggattacaaatgcaagaaaggctacttttacatcccccccctggccac ccacaccatctgtgatagaaaccacacatggctgcctgtctccgacgacgcctgttaccgggagacatgcccttacatccgagaccctctcaatgga caggccgtgcctgctaatggcacatatgagttcggataccaaatgcacttcatctgcaacgagggctactacctgatcggcgaagaaatcctgtact gcgagctgaaaggctcggtggctatttggtccggcaaacctcctatctgtgaaaaggtgctgtgcacccctcctcctaagatcaaaaacggcaagc acacctttagcgaggtggaagtgttcgagtacctggatgccgtgacatatagctgtgaccccgcccctggccctgatcccttcagcctgattggcga gagcaccatctattgcggcgataactctgtgtggagccgggccgcccctgaatgcaaggtggtgaagtgcagattccctgtggtggaaaacggaa agcagatctccggctttggcaaaaagttctactataaggctaccgtgatgttcgagtgcgacaagggattctacctggacggctctgatacaatcgtg tgcgacagcaactctacgtgggaccctccagtgcctaagtgtctgaaagttctgcctcctagctctacaaagccccccgccctgagccacagcgtgt ccaccagcagcacaaccaagtccccagccagcagcgccagcggacctagacccacctacaagcctcctgtgtccaactaccctggctaccccaa gcctgaggaaggcatcctggatagcctggatggcggcggcggctccggcggtggaggatctggcggcggaggaagcacattatctggtgaatc agagcggctacgccctggttgccagcggcagaagcggcaacctgggcttcaagctgtttagcacacagagccccagcgccgaggtgaagctga agagcttgtcgctaaatgatggctcctaccagtctgagatcgatctgagcgggggcgccaattttagagagaagttccggaacttcgcaaacgagct gtctgaagccatcaccaacagccctaaggggctggacagacctgtgccaaagaccgagattagcggcctcatcaagacaggcgacaatttcatca cacctagcttcaaggccggatactatgaccacgtggcctccgacggcagcctgctgagctactaccagagcacagagtacttcaacaacagagtg ctgatgcctatcctgcagaccaccaacggcaccctcatggccaacaatcggggctatgacgacgtgttcaggcaggtgcctagcttcagcggctg gagcaacaccaaggccaccactgtgtctacctccaacaacctgacctacgacaagtggacctacttcgcagctaaaggctctccactgtacgatag ctacccaaaccacttcttcgaggacgtgaagaccctggctattgacgccaaggacatctctgccctgaagaccacaatcgacagcgagaagccta cctacctgatcatccggggcctgagcggaaacggcagccagctgaacgagctgcagctgcccgagtccgtgaaaaaagtgtccctgtacggcga ctacaccggcgtgaacgtggccaagcagatcttcgctaatgtggtggaacttgagttctactctaccagtaaggccaactcctttggatttaaccccct ggtgctgggcagcaagaccaacgtgatctacgacctgttcgcctctaaacctttcacccatatcgacctgacccaggttacactgcaaaacagcgat aactctgccatcgatgccaacaagctgaagcaagccgtgggcgacatctacaactaccgcagatttgaacggcagttccagggctacttcgccgg cggctacatcgacaagtacttggtcaagaacgtgaataccaacaaggatagcgacgatgacctggtctaccggagcctgaaggaactgaacctgc acctggaagaagcctacagagaaggtgacaatacctactatagagtgaacgagaactactacccgggagccagtatctacgagaacgaaagagc ctctagagatagcgagttccaaaacgagatcctgaaaagagctgaacagaacggcgtgaccttcgacgagaacatcaagagaatcaccgccagc ggcaagtacagcgtgcagtttcagaagctggaaaacgacaccgacagctccctggaacggatgaccaaggctgttgagggcctggtcacagtga tcggagaggaaaagttcgaaacagtggatatcacgggcgttagcagcgacaccaacgaggtcaagagcctggccaaagagctgaagacaaac gccctgggcgtgaagctgaagctgtaa Artificial armY-Glycophorin-A fusion protein codon-optimized (for human) nucleotide sequence (1,884 bp) 59) SEQ ID NO: 59 atgtaccgtatgcagctgctgtcttgcatcgccctcagcctggctctggtgaccaacagctctagcacaacaggcgttgccatgcacac cagcaccagctctagcgtgaccaagagttacatctcttctcagaccaacgatacccacaagagagacacgtacgccgccaccccaagagcccatg aggtgtctgaaatcagcgtgcggaccgtgtacccccccgaggaagaaaccggcgagcgggtgcagctggcccaccacttttctgagcctgaggg aggtggaggcagcggcggcggcggcagcggcggaggcggcagcaccaacctggttaaccagtccggctatgccctggtggctagcggcaga tccggcaacctgggctttaagctgttcagcacccagagccccagcgccgaggtgaaactgaagagtctgagcctgaatgacggctcttatcagag cgagatcgacctgagcggcggcgccaatttcagagagaagtttcggaacttcgccaatgaactgtccgaagccatcaccaacagcccaaagggc ctggacagacccgtgcctaaaacagaaatcagcggactgatcaagaccggcgataatttcatcacacctagcttcaaggccggctactacgacca cgtggccagcgacggctccctcctgagctactaccaaagcacagagtacttcaacaaccgggtgctgatgcctatcctgcagaccacaaatggca ccctcatggccaataacagaggctatgatgacgtgttccggcaggtgcccagctttagcggatggagcaacaccaaggccacaaccgtgtccaca tccaacaacctgacctacgacaagtggacctacttcgctgctaagggcagccctctgtacgactcttaccctaaccacttcttcgaggatgtgaagac gctggctatcgacgccaaggacatctcggccctgaagaccacaatcgacagcgagaagcctacatacctgatcatcagaggactgagcggcaac ggcagccaactgaatgagctgcagctgcctgagagcgtgaaaaaggtgagcctgtacggcgactataccggcgtgaatgtggctaagcagatctt cgccaacgtcgtggaactggaattctacagcaccagcaaggctaactccttcggctttaaccccctggtgctgggctccaaaacaaacgtgatctac gacctgttcgcctccaaacctttcacccacatcgacctgacacaagtgacactgcaaaatagcgataacagcgccatcgacgccaacaagcttaag caggccgtgggcgacatctacaactacagaagattcgagagacagtttcagggctatttcgccggaggctatattgataaatacctggtgaagaac gtgaacaccaacaaagatagcgacgacgatctggtgtacagatctctgaaagagctgaacctgcacctggaagaggcctaccgggaaggagata acacctactacagggtcaacgagaactactaccctggagccagcatctacgagaacgagagagcttctagagatagcgagttccagaatgaaatc ctgaagcgggccgaacagaacggagtgacattcgacgagaacattaagcggatcaccgcctctgggaagtacagcgtgcagttccagaagctg gagaacgacaccgattcttctctggaaagaatgaccaaggcagtcgagggcctggtgaccgtgatcggagaggaaaagttcgagacagtcgaca tcactggcgtgagctcggacaccaacgaggtaaagagcctggccaaggaactgaagaccaacgccctgggcgtgaagctcaaactgtaa Artificial armY-C-type lectin domain family 4 member K (Langerin, CD207) fusion protein codon- optimized (for human) nucleotide sequence (2,460 bp) 60) SEQ ID NO: 60 atgtatcggatgcagctgctgagctgcatcgccttatccctggctctggtgacaaactcccctagattcatgggcaccatcagcgacgtg aaaacgaacgtgcagctgctgaagggaagagtggacaacatctctaccctggattctgagatcaaaaagaactccgatggcatggaagctgctgg cgtgcaaatccagatggtgaatgagagcctgggctacgtgcggtcccagttcctgaagctgaagaccagcgtggaaaaggccaacgcccagatt cagatcctgacaagaagctgggaggaagtgtctacactgaatgctcagatccccgagctgaaaagcgatctcgagaaggctagcgccctgaaca ccaagatccgggccttgcaaggctctctggaaaacatgagcaagctgctgaagagacagaacgatatcctgcaggtcgtgtctcagggctggaag tacttcaagggcaacttctactacctctgatccctaagacctggtactctgccgagcagttctgcgtgtccagaaacagccacctgaccagcgtta ccagtgagagcgagcaggagttcctgtataagacagccggaggcctgatctattggatcggcctgaccaaggccggcatggagggcgattggag ctgggtcgacgacacccctacaacaaagtgcagagcgtgcggttaggatccccggcgagcctaacaacgccggcaacaacgagcactgcggc aatatcaaagcccctagcctgcaggcctggaacgatgccccgtgcgacaagacatactgttcatctgtaaaaggccttacgtgcccagcgaaccc ggcggcggcggcagcggaggcggcggctctggcggaggaggaagcaccaacctggtgaaccagagcggctacgccctggtcgccagcggc agaagcggaaatctgggcttcaagctgtttagcacacagagcccatctgcagaggtgaaactgaagagcctgagcctgaacgacggcagctacc agtctgagatcgacctgtctggcggggccaatttccgggaaaagttccggaacttcgctaacgagctgtctgaagccatcaccaatagtccaaagg gcctggaccggcctgtgcctaagactgagatactggccttatcaagacaggcgacaacttcatcacccctagattaaggccggctactacgacca cgtggccagcgatgggtctctgctgagctactaccagagcacagagtacttcaacaatagagtgctgatgccaatcctgcaaacaacaaatggcac actgatggccaacaaccggggctacgacgatgtgttcagacaggttcctagcttcagcggctggtccaacaccaaggccaccaccgtgagcaca agcaacaacctgacatatgataagtggacctacttcgccgctaagggcagccctctgtacgacagctaccctaaccatacttcgaggacgtgaaga cgctggccattgacgccaaagacatctcggccctgaagaccaccatcgacagcgaaaaacctacctacctgatcatcagaggcctgagcggcaa cggatctcagctgaacgagctgcagctgcccgagagcgtgaagaaggtgagcctctacggcgactacaccggcgtgaacgtggccaagcagat tttcgcaaacgtggtggaactggaattttacagcacctccaaggctaacagcttcggctttaaccccctggtgctgggatctaagaccaatgtgatcta cgacctcttcgcttccaagccctttacccacatcgacctgacccaggtgaccctgcaaaattcagataatagcgccatcgacgccaacaagctgaaa caagccgtgggcgacatctacaactacagaagattcgagcgccagttccagggctattttgctggcggttacatcgacaagtacctggtgaaaaac gtgaacaccaacaaggacagcgacgatgacctggtgtacagatccctgaaagagctgaacctgcacctggaagaggcctacagagagggcgat aatacctactatagagtgaatgagaactactaccctggcgccagtatctacgagaacgaaagagctagcagagacagcgagttccagaacgagat cctgaagcgggccgagcagaatggcgtgaccttcgacgagaacatcaagcggatcacagccagcggcaagtacagcgtgcagttccagaaact ggaaaacgacacagatagcagcctcgagagaatgaccaaggccgtggaaggactggtgaccgtcatcggcgaagaaaagttcgaaacggtgg acatcaccggagtgtcctccgacaccaatgaggtgaagtccctggccaaggaactgaagaccaatgccctcggagtgaagctgaagctataa Artificial armY-Anthrax toxin receptor 1 fusion protein codon-optimized (for human) nucleotide sequence (3,264 bp) 61) SEQ ID NO: 61 atgtacagaatgcagctgttgagctgtatcgccctgagcctggccctggtgaccaacagcgaggacggtggccctgcctgctacggc gggtagacctgtacttcatcctggataagtccggttctgtgctgcaccactggaacgaaatctactacttcgtggaacagctggcccacaagtttatct cccctcagctgcggatgagcttcatcgtgttctccacaagaggcaccaccctgatgaagctgaccgaggatcgcgagcagatcagacagggactg gaagagctgcagaaagtgctgcctggcggcgatacatacatgcacgagggatttgagagagcctccgagcagatctattacgagaacagacagg gctaccgcaccgccagcgtgatcattgccctgacagacggcgagctgcatgaagatctgttcttctacagcgagcgcgaggccaacagaagccg ggacctgggcgccatcgtgtactgtgtgggcgtgaaggacttcaacgaaacccagctggccagaatcgccgatagcaaggatcacgtgttccctg tgaacgacggattccaggccctgcagggcatcatccacagcattctaaagaagtcctgcatcgagatcctggctgctgaacccagcaccatctgcg ccggcgagagcttccaggtggtggtgcggggcaacggcttccggcacgccagaaacgtggacagagttctgtgcagctttaagatcaatgatagc gtgacacttaacgagaagcccttcagcgtggaagatacctacctgctgtgtcctgctccaatcttaaaagaggtgggaatgaaagccgccctgcaa gtgtccatgaacgatggcctctcttttatcagttccagcgtgatcatcaccacaacccactgttctgatggtagcatcctggccatcgccctgctcatcc tgtttctgctgctggccttggccctgctgtggtggttctggcctctgtgctgcaccgtgatcatcaaagaagtgcctcctcctcccgctgaagagagcg aagaggaggacgacgacggcctgcctaagaaaaagtggcccacagtcgatgcttcttactacggcggcagaggcgttggcgggatcaagcgga tggaagtgcggtggggagaaaagggcagtaccgaggaaggagctaagctggaaaaggccaagaatgccagagtgaagatgcctgagcagga gtacgagttccccgagcctcggaacctgaacaacaacatgagacggccctcctctccaagaaagtggtacagccctatcaagggcaagctggac gccctctgggtcctgctgagaaagggctacgacagagtgagcgtgatgcggccccagcctggcgacactggcagatgcatcaactttaccaggg tgaagaacaaccagcctgccaagtaccccctgaacaacgcctaccacacaagctctcctcctcccgctcccatctacactccgccccccccagcc ccacactgccctcccccaccaccctctgcccctacccctcccatccccagccccccttcaaccctgcctccccctccgcaagcccctccaccaaac agagcacctccacctagcagaccccctcctagaccttctgtgggcggcggcggcagcggcggaggcggcagcggcggaggcgggagcacca acctggtgaaccagagcggctacgccctggtggcctccggcagaagcggcaacctgggcttcaagctgttctcgacccagagcccttctgccga ggtgaagctgaaaagcctgtcactgaatgacggctcttaccagagcgagatcgacctgagcggcggagctaacttcagagaaaagttccggaact tcgccaacgagctgtctgaggccatcaccaacagccctaagggcctggacagacccgtacccaagaccgagatcagcggactgattaagacgg gcgacaacttcatcacaccttccttcaaggctggatactacgatcatgtggccagcgacggcagcctgctgagctactaccagtccacagagtactt caacaacagagtcctgatgcctatcctccagaccaccaatggcaccctgatggccaacaatagaggctacgacgacgtgttcaggcaggttccttc tttctccggctggagcaacacaaaggccaccacagtgagcacaagcaataacctcacctacgacaaatggacctacttcgctgccaagggcagcc ccctctacgactcttatcctaaccactttttcgaggatgtgaaaacactggctatcgatgccaaggacatcagcgcccttaaaacaacaatcgactccg agaaacctacctacctgatcatcagaggcctgtccggcaatggcagccagctgaacgagctgcaactgcctgaaagcgtgaaaaaagtgagcct gtatggggactacaccggcgtgaacgtggccaagcagatcttcgccaatgtggtggaactggagttctacagcactagcaaggccaattctttcgg ctttaaccccctggtgctgggcagcaagacaaacgtgatctacgatctgttcgccagcaagcctttcacccacatcgacctgacacaggtgacgctg cagaacagcgacaacagcgccatcgacgccaacaagctgaagcaggccgtgggcgacatttacaactaccggagattcgagagacaatttcag ggctatttcgccggcggatacatcgacaagtatctggtcaaaaatgtgaataccaacaaggatagcgacgacgacctggtataccggtccctgaaa gaactgaacctgcacttggaggaagcctacagagagggcgacaatacctactatagagtcaacgagaactactaccctggcgcctccatctacga aaatgaacgggcctctagagactctgagttccaaaacgagatcctgaaaagagcagagcagaatggcgtcaccttcgacgagaacatcaagcgc attaccgccagcggaaagtactccgtgcagttccagaagctggagaacgataccgacagctctctggaacggatgaccaaggccgtggagggac tggtcaccgtgatcggcgaagagaagttcgaaaccgtggacatcaccggcgtgtcttctgacacaaacgaagtgaaaagcctggctaaagagctg aagacaaacgccctgggagtgaagctgaagctgtaa Artificial armY-Anthrax toxin receptor 2 fusion protein codon-optimized (for human) nucleotide sequence (2,523 bp) 62) SEQ ID NO: 62 atgtacagaatgcagctgctctcttgcattgccctgagcctggccctggtgaccaatagccaggagcaacctagctgcagaagagcctt cgacctctacttcgtgctggataagtccggcagcgtcgccaacaattggatcgagatctacaacttcgtacagcagctggccgaacgcttcgtgagc cccgagatgagactgagcttcatcgtgttctcttcccaggccaccatcatcctgcctctgaccggcgacagaggcaaaatctcaaagggcctggaa gatctgaaaagagtgtcccccgtcggcgagacatacatccacgagggcctgaagctggccaatgaacagatccagaaggccggcggactgaag accagcagcatcatcattgccctgaccgacggcaaactggacggcctggtccctagctacgccgagaaggaagccaagatcagccggagcctg ggcgcttctgtgtactgcgtgggagtgctggacttcgagcaggctcaactggagaggatcgctgatagcaaggagcaggttttcccagtgaaagg cggctttcaagccctgaaaggcatcatcaacagcatcctggcccagagctgtacagagatcctggaactccagcctagcagcgtgtgcgtcggcg aagagttccagatcgtgttaagcggcagaggcttcatgctgggcagcagaaacggcagcgtgctgtgcacatacaccgtcaatgagacctacaca acaagcgtgaagcccgtgtccgtgcagctgaatagcatgctgtgtcctgcccctatcctcaacaaggccggcgaaaccctggacgtgtccgtgtctt tcaatggcggcaagagcgtaatctccggctctctgatcgtgacagccaccgagtgcagcaacggaggcggaggcggatctggtggcggaggat cgggcggtggcggtagcaccaacctggtgaaccagtcaggctacgcccttgtggccagcggaagatccggcaacctgggctttaagctgttttcta cacagagcccatctgctgaagtgaagctgaagtctctcagcctgaacgacggctcttatcagtccgagatcgatctgagcggaggagccaatttcc gggagaagttcagaaactttgctaatgagctgagcgaagccatcacaaacagccctaagggcctggatagacctgtgcccaagaccgagatcag cggactgatcaagacaggcgacaacttcatcaccccaagcttcaaggctggctactatgaccacgtggcctctgatggatccctgctgtcttattacc agagcacagaatacttcaacaacagagtgctgatgcctatcctgcaaaccaccaatggaacgctgatggccaacaaccggggctacgatgacgtg ttcagacaggtgcctagcttcagcggatggagcaacaccaaggccacaacagtcagcacctctaacaacctgacctacgacaagtggacctacttt gccgctaagggctctccactgtacgatagctaccccaaccacttctttgaggacgtgaagacactggccatcgatgccaaagacatatctgcgctga agaccaccatcgacagcgagaagcctacatatctgatcatcagaggcttgagcggcaacgggtctcagctgaacgagcttcagctgcctgagagc gtgaaaaaggtgagcctgtacggcgactacaccggcgtgaacgtggccaagcagatcttcgctaacgtggtggaattagagttctacagcaccag caaggccaacagcttcggcttcaaccccctggtgctgggctctaagacaaacgtgatctacgatctgttcgccagcaaacccttcacccacatcgat ctgacccaggtgaccctgcagaactccgacaacagcgccatcgacgccaacaagctgaaacaggccgtgggcgacatctacaattaccggagat tcgagcggcaattccagggctactttgcgggcggctacatcgacaagtacctggtgaagaacgtgaacacgaacaaggacagcgacgacgacct ggtgtaccggagccttaaggagctgaacctgcatctggaagaagcctaccgggagggcgataacacatattaccgggtgaatgagaactactacc ctggcgccagcatctacgagaacgagagagccagcagagatagcgaattccaaaacgaaatcctgaagcgggccgagcagaacggcgtgactt tcgacgagaatattaagagaatcaccgcctccggaaagtacagcgtgcagtttcagaaactggaaaacgatacagactcaagcttggagcgcatg accaaggccgtggaaggcctggtgaccgtaatcggcgaggaaaaattcgaaaccgtggacattaccggcgtgtcttctgacaccaacgaggtga agagcctggctaaagagctgaagaccaacgccctgggcgtcaagctgaagctgtaa Artificial Protein M with radiolabel peptide tag (KGRPLVY) protein sequence (555 amino acids). Including the human IL-2 signal sequence, radiolabel tag, linker, Mycoplasma genitalium protein M 63) SEQ ID NO: 63 MYRMQLLSCIALSLALVTNSKGRPLVYGGSGGGGSTNLVNQSGYALVASGRSGNLG FKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANFREKFRNFANELSEAITNSPKGLDRPVP KTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYYQSTEYFNNRVLMPILQTTNGTLMANN RGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFFEDVKTLAI DAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLPESVKKVSLYGDYTGVNVAKQIFANV VELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFTHIDLTQVTLQNSDNSAIDANKLKQA VGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKDSDDDLVYRSLKELNLHLEEAYREGD NTYYRVNENYYPGASIYENERASRDSEFQNEILKRAEQNGVTFDENIKRITASGKYSVQFQKL ENDTDSSLERMTKAVEGLVTVIGEEKFETVDITGVSSDTNEVKSLAKELKTNALGVKLKL Substitution #1: Alanine mutagenesis (underlined “A”) of a) 494-507 amino acid (highlighted in green) and b) 527-540 amino acid (highlighted in green) predicted to be immunogenic in Protein M (469-556 amino acid). See SEQ ID NO: 1 for the original sequence (37-556 amino acids). 64) SEQ ID NO: 64 TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANF REKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYY QSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKW TYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFT HIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKD SDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAE QNGVTFDENIKRITASGKYSVQFQALANATASALAAMTKAVEGLVTVIGEEKFETVAIAGVA SATNAVASLAKELKTNALGVKLKL Substitution #2: Alanine mutagenesis (underlined “A”) of a) 494-507 amino acid (highlighted in green) and b) 527-540 amino acid (highlighted in green) predicted to be immunogenic in Protein M (469-556 amino acid). See SEQ ID NO: 1 for the original sequence (37-556 amino acids). 65) SEQ ID NO: 65 TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANF REKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYY QSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKW TYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFT HIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKD SDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAE QNGVTFDENIKRITASGKYSVQFAKLANATASALARMTKAVEGLVTVIGEEKFETVAIAGVA SATNAVKSLAKELKTNALGVKLKL Substitution #3: Alanine mutagenesis (underlined “A”) of a) 494-507 amino acid (highlighted in green) and b) 527-540 amino acid (highlighted in green) predicted to be immunogenic in Protein M (469-556 amino acid). See SEQ ID NO: 1 for the original sequence (37-556 amino acids). 66) SEQ ID NO: 66 TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANF REKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYY QSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKW TYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFT HIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKD SDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAE QNGVTFDENIKRITASGKYSVQFQALEADADSALEAMTKAVEGLVTVIGEEKFETVDIAGVS ADTAEVASLAKELKTNALGVKLKL Substitution #4: Alanine mutagenesis (underlined “A”) of a) 494-507 amino acid (highlighted in green) and b) 527-540 amino acid (highlighted in green) predicted to be immunogenic in Protein M (469-556 amino acid). See SEQ ID NO: 1 for the original sequence (37-556 amino acids). 67) SEQ ID NO: 67 TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANF REKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYY QSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKW TYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFT HIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKD SDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAE QNGVTFDENIKRITASGKYSVQFAKLANDTASSAERATKAVEGLVTVIGEEKFETVAITGASS ATNAVKALAKELKTNALGVKLKL Armoracia rusticana Horseradish peroxidase mature protein sequence (31-338 amino acids). 68) SEQ ID NO: 68 QLTPTFYDNSCPNVSNIVRDTIVNELRSDPRIAASILRLHFHDCFVNGCDASILLDNTTS FRTEKDAFGNANSARGFPVIDRMKAAVESACPRTVSCADLLTIAAQQSVTLAGGPSWRVPLG RRDSLQAFLDLANANLPAPFFTLPQLKDSFRNVGLNRSSDLVALSGGHTFGKNQCRFIMDRL YNFSNTGLPDPTLNTTYLQTLRGLCPLNGNLSALVDFDLRTPTIFDNKYYVNLEEQKGLIQSD QELFSSPNATDTIPLVRSFANSTQTFFNAFVEAMDRMGNITPLTGTQGQIRLNCRVVNSNS Escherichia coli Alkaline phosphatase mature protein sequence (22-471 amino acids). 69) SEQ ID NO: 69 RTPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDS EITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYN GALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCP GNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSD AASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLA QMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNT LVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYG PHAANVVGLTDQTDLFYTMKAALGLK Photinus pyralis Luciferase protein sequence (1-550 amino acid). 70) SEQ ID NO: 70 MEDAKNIKKGPAPFYPLEDGTAGEQLHKAMKRYALVPGTIAFTDAHIEVNITYAEYF EMSVRLAEAMKRYGLNTNHRIVVCSENSLQFFMPVLGALFIGVAVAPANDIYNERELLNSM NISQPTVVFVSKKGLQKILNVQKKLPIIQKIIIMDSKTDYQGFQSMYTFVTSHLPPGFNEYDFV PESFDRDKTIALIMNSSGSTGLPKGVALPHRTACVRFSHARDPIFGNQIIPDTAILSVVPFHEIGF GMFTTLGYLICGFRVVLMYRFEEELFLRSLQDYKIQSALLVPTLFSFFAKSTLIDKYDLSNLHE IASGGAPLSKEVGEAVAKRFHLPGIRQGYGLTETTSAILITPEGDDKPGAVGKVVPFFEAKVV DLDTGKTLGVNQRGELCVRGPMIMSGYVNNPEATNALIDKDGWLHSGDIAYWDEDEHFFIV DRLKSLIKYKGYQVAPAELESILLQHPNIFDAGVAGLPDDDAGELPAAVVVLEHGKTMTEKE IVDYVASQVTTAKKLRGGVVFVDEVPKGLTGKLDARKIREILIKAKKGGKSKL Artificial Xpress tag, a peptide recognized by an antibody 71) SEQ ID NO: 71 DLYDDDDK Artificial E-tag, a peptide recognized by an antibody (13 amino acid) 72) SEQ ID NO: 72 GAPVPYPDPLEPR Artificial FLAG-tag, a peptide recognized by an antibody (8 amino acid) 73) SEQ ID NO: 73 DYKDDDDK Artificial HA-tag, a peptide recognized by an antibody (9 amino acid) 74) SEQ ID NO: 74 YPYDVPDYA Artificial HA-tag, a peptide recognized by an antibody (9 amino acid) 75) SEQ ID NO: 75 YPYDVPDYA Artificial His-tag, 5-10 histidines bound by a nickel or cobalt chelate or antibody (6 amino acid) 76) SEQ ID NO: 76 HHHHHH Artificial Myc-tag, a short peptide recognized by an antibody (14 amino acid) 77) SEQ ID NO: 77 EQKLISEEDLLRKR Artificial S-tag, a short peptide recognized by an antibody (15 amino acid) 78) SEQ ID NO: 78 KETAAAKFERQHMDS Artificial Softag 1, for mammalian expression, a short peptide recognized by an antibody (13 amino acid) 79) SEQ ID NO: 79 SLAELLNAGLGGS Artificial VSV-tag, a peptide recognized by an antibody (11 amino acid) 80) SEQ ID NO: 80 YTDIEMNRLGK Artificial Softag 3, for prokaryotic expression, a short peptide recognized by an antibody (8 amino acid) 81) SEQ ID NO: 81 TQDPSRVG Artificial VS tag, a peptide recognized by an antibody (14 amino acid) 82) SEQ ID NO: 82 GKPIPNPLLGLDST Artificial Avi-Tag, a peptide allowing biotinylation by the enzyme BirA and so the protein can be isolated by streptavidin and/or avidin (20 amino acid) 83) SEQ ID NO: 83 MAGGLNDIFEAQKIEWHEGG Artificial SBP-tag, a peptide which binds to streptavidin (38 amino acid) 84) SEQ ID NO: 84 MDEKTTGWRGGHVVEGLAGELEQLRARLEHHPQGQREP Artificial Strep-tag (Strep-tag II), a peptide which binds to streptavidin or the modified streptavidin called streptactin (8 amino acid) 85) SEQ ID NO: 85 WSHPQFEK Escherichia coli BCCP (Biotin Carboxyl Carrier Protein), a protein domain biotinylated by BirA enabling recognition by streptavidin (73-156 amino acids) 86) SEQ ID NO: 86 PAAAEISGHIVRSPMVGTFYRTPSPDAKAFIEVGQKVNVGDTLCIVEAMKMMNQIEA DKSGTVKAILVESGQPVEFDEPLVVIE Artificial TC tag, a tetracysteine tag that is recognized by FlAsH and ReAsH biarsenical compounds (6 amino acid) 87) SEQ ID NO: 87 CCPGCC Artificial Calmodulin-tag, a peptide bound by the protein calmodulin (26 amino acid) 88) SEQ ID NO: 88 KRRWKKNFIAVSAANRFKKISSSGAL Artificial Polyglutamate tag, a peptide binding efficiently to anion-exchange resin such as Mono-Q (6 amino acids) 89) SEQ ID NO: 89 EEEEEE Rhodococcus sp./Artificial Halo-tag, a mutated hydrolase that covalently attaches to the HaloLin Resin (297 amino acid) 90) SEQ ID NO: 90 MAEIGTGFPFDPHYVEVLGERMHYVDVGPRDGTPVLFLHGNPTSSYVWRNIIPHVAP THRCIAPDLIGMGKSDKPDLGYFFDDHVRFMDAFIEALGLEEVVLVIHDWGSALGFHWAKR NPERVKGIAFMEFIRPIPTWDEWPEFARETFQAFRTTDVGRKLIIDQNVFIEGTLPMGVVRPLT EVEMDHYREPFLNPVDREPLWRFPNELPIAGEPANIVALVEEYMDWLHQSPVPKLLFWGTPG VLIPPAEAARLAKSLPNCKAVDIGPGLNLLQEDNPDLIGSEIARWLSTLEISG Escherichia coli Maltose binding protein-tag, a protein which binds to amylose agarose (27-396 amino acid) 91) SEQ ID NO: 91 KIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGP DIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKD LLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVG VDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNY GVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVAL KSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQ TRITK Escherichia coli Nus-tag, recognized by an antibody (1-495) 92) SEQ ID NO: 92 MNKEILAVVEAVSNEKALPREKIFEALESALATATKKKYEQEIDVRVQIDRKSGDFDT FRRWLVVDEVTQPTKEITLEAARYEDESLNLGDYVEDQIESVTFDRITTQTAKQVIVQKVREA ERAMVVDQFREHEGEIITGVVKKVNRDNISLDLGNNAEAVILREDMLPRENFRPGDRVRGVL YSVRPEARGAQLFVTRSKPEMLIELFRIEVPEIGEEVIEIKAAARDPGSRAKIAVKTNDKRIDPV GACVGMRGARVQAVSTELGGERIDIVLWDDNPAQFVINAMAPADVASIVVDEDKHTMDIAV EAGNLAQAIGRNGQNVRLASQLSGWELNVMTVDDLQAKHQAEAHAAIDTFTKYLDIDEDF ATVLVEEGFSTLEELAYVPMKELLEIEGLDEPTVEALRERAKNALATIAQAQEESLGDNKPAD DLLNLEGVDRDLAFKLAARGVCTLEDLAEQGIDDLADIEGLTDEKAGALIMAARNICWFGDE A Escherichia coli Thioredoxin-tag is commonly used in expression and purification of recombinant proteins. It improves the solubility of that protein of interest. Recognized by an antibody (2-109 amino acid) 93) SEQ ID NO: 93 SDKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKL NIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLA Artificial Isopeptag, a peptide which binds covalently to pilin-C protein (16 amino acid) 94) SEQ ID NO: 94 TDKDMTITFTNKKDAE Artificial SpyTag, a peptide which binds covalently to SpyCatcher protein (13 amino acids) 95) SEQ ID NO: 95 AHIVMVDAYKPTK Aequorea victoria Green fluorescent protein-tag, a protein which is spontaneously fluorescent and can be bound by antibodies (1-238 amino acid) 96) SEQ ID NO: 96 MSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPW PTLVTTFSYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTL VNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADH YQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITHGMDELYK Artificial Allows for cleavage by TEV protease between the Gln and Ser residues (7 amino acid) 97) SEQ ID NO: 97 ENLYFQS Artificial Allows for cleavage by Thrombin protease between Arg and Gly residues (6 amino acid) 98) SEQ ID NO: 98 LVPRGS Artificial Allows for cleavage by PreScission protease between the Gln and Gly residues (8 amino acid) 99) SEQ ID NO: 99 LEVLFQGP Human C1q A-chain mature amino acid sequence (23-245amino acid) 100) SEQ ID NO: 100 EDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIRTGIQGLKGDQGEPGPSGNP GKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAIRRNPPMGGNVVIFDTVITNQEEP YQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGM VLQLQQGDQVWVEKDPKKGHIYQGSEADSVFSGFLIFPSA Human C1q B-chain mature amino acid sequence (28-253 amino acid) 101) SEQ ID NO: 101 QLSCTGPPAIPGIPGIPGTPGPDGQPGTPGIKGEKGLPGLAGDHGEFGEKGDPGIPGNP GKVGPKGPMGPKGGPGAPGAPGPKGESGDYKATQKIAFSATRTINVPLRRDQTIRFDHVITN MNNNYEPRSGKFTCKVPGLYYFTYHASSRGNLCVNLMRGRERAQKVVTFCDYAYNTFQVT TGGMVLKLEQGENVFLQATDKNSLLGMEGANSIFSGFLLFPDMEA Human C1q C-chain mature amino acid sequence (29-245 amino acid) NTGCYGIPGMPGLPGAPGKDGYDGLPGPKGEPGIPAIPGIRGPKGQKGEPGLPGHPGK NGPMGPPGMPGVPGPMGIPGEPGEEGRYKQKFQSVFTVTRQTHQPPAPNSLIRFNAVLTNPQ GDYDTSTGKFTCKVPGLYYFVYHASHTANLCVLLYRSGVKVVTFCGHTSKTNQVNSGGVLL RLQVGEEVWLAVNDYYDMVGIQGSDSVFSGFLLFPD 

What is claimed is:
 1. A conjugate comprising: a first polypeptide having a specific binding affinity for an immunoglobulin molecule, wherein the first polypeptide has at least 99% identity over its entire length to one of SEQ ID NOS: 1, 2, and 63-66; conjugated to a second polypeptide having a specific binding affinity for a pathogen, a toxin or a cancer cell.
 2. The conjugate of claim 1, wherein the second polypeptide is a protein fragment of a cellular receptor.
 3. The conjugate of claim 2, wherein the second polypeptide has at least 95% identity over its entire length to one of SEQ ID NOS: 15-36.
 4. The conjugate of claim 3, wherein the pathogen is SARS-CoV-2 virus and the second polypeptide comprises the sequence of SEQ ID NO:
 15. 5. The conjugate of claim 1, wherein the second polypeptide is a single-domain antibody.
 6. The conjugate of claim 1, further comprising a spacer between the first polypeptide and the second polypeptide.
 7. The conjugate of claim 6, wherein the spacer is a cleavable peptide having a sequence of one of SEQ ID NOS: 96-98.
 8. The conjugate of claim 1, wherein the conjugate protein bound to an immunoglobulin is capable of recruiting the C1q complement component.
 9. A fusion protein comprising: a first polypeptide having a specific binding affinity for an immunoglobulin molecule, wherein the first polypeptide has at least 99% identity over its entire length to one of SEQ ID NOS: 1, 2, and 63-66; genetically fused to a second polypeptide having a specific binding affinity for a pathogen, a toxin or a cancer cell.
 10. The fusion protein of claim 9, wherein the second polypeptide is a protein fragment of a cellular receptor.
 11. The fusion protein of claim 10, wherein the second polypeptide has at least 95% identity over its entire length to one of SEQ ID NOS: 15-36.
 12. The fusion protein of claim 11, wherein the pathogen is SARS-CoV-2 virus and the second polypeptide comprises the sequence of SEQ ID NO:
 15. 13. The fusion protein of claim 9, wherein the second polypeptide is a single-domain antibody.
 14. The fusion protein of any one of claim 9, further comprising a spacer between the first polypeptide and the second polypeptide.
 15. The fusion protein of claim 14, wherein the spacer is a cleavable peptide having a sequence of one of SEQ ID NOS: 96-98.
 16. The fusion protein of claim 9, wherein the fusion protein bound to an immunoglobulin is capable of recruiting the C1q complement component.
 17. A method for inactivating, eliminating or eradicating a bloodborne pathogen or cancer cell in a subject, comprising: obtaining a sample of blood, serum or plasma from the subject or from a donor compatible with the subject, wherein the sample comprises immunoglobulins; adding the conjugate of claim 1 or the fusion protein of claim 9 to the sample, wherein the first polypeptide binds to the immunoglobulins present in the sample, and wherein the second polypeptide has a specific binding affinity for the pathogen or cancer cell; and administrating the sample comprising the conjugate or fusion protein bound to the immunoglobulins to the subject; thereby inactivating, eliminating or eradicating the bloodborne pathogen or cancer cell in the subject.
 18. The method of claim 17, wherein the second polypeptide is a protein fragment of a cellular receptor.
 19. The method of claim 18, wherein the second polypeptide has at least 95% identity over its entire length to one of SEQ ID NOS: 15-18, 24-25, 28, and 30-34.
 20. The method of claim 19, wherein the pathogen is SARS-CoV-2 virus and the second polypeptide comprises the sequence of SEQ ID NO:
 15. 21. The method of claim 17, wherein the second polypeptide is a single-domain antibody.
 22. The method of claim 17, the fusion protein comprises a spacer between the first polypeptide and the second polypeptide.
 23. The method of claim 22, wherein the spacer is a cleavable peptide having a sequence of one of SEQ ID NOS: 96-98.
 24. The method of claim 17, wherein the conjugate or fusion protein bound to the immunoglobulins inactivates, eliminates or eradicates the bloodborne pathogen or cancer cell via recruitment of the C1q complement component.
 25. A method for neutralizing a toxin in a subject, comprising: obtaining a sample of blood, serum or plasma from the subject or from a donor compatible with the subject, wherein the sample comprises immunoglobulins; adding the conjugate of claim 1 or the fusion protein of claim 9 to the sample, wherein the first polypeptide binds to the immunoglobulins present in the sample, and wherein the second polypeptide has a specific binding affinity for the toxin; and administrating the sample comprising the conjugate or fusion protein bound to the immunoglobulins to the subject; thereby neutralizing the toxin in the subject.
 26. The method of claim 25, wherein the second polypeptide is a protein fragment of a cellular receptor.
 27. The method of claim 26, wherein the second polypeptide has at least 95% identity over its entire length to one of SEQ ID NOS: 19-29 and 35-36.
 28. The method of claim 25, wherein the second polypeptide is a single-domain antibody.
 29. The method of claim 25, the fusion protein comprises a spacer between the first polypeptide and the second polypeptide.
 30. The method of claim 29, wherein the spacer is a cleavable peptide having a sequence of one of SEQ ID NOS: 96-98. 